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E horas sem conta passo, mudo, 
O olhar atento, 

A trabalhar, longe de tudo 

O pensamento. 


Porque o escrever - tanta pericia, 
Tanta requer, 

Que oficio tal.., nem ha noticia 
De outro qualquer. 


Profissao de Fé (excerto) Olavo Bilac 


Preface 


This is the final volume of a series of three volumes (the other ones being [9] and 
[8]) devoted to the mathematics of mathematical olympiads. Generally speaking, 
they are somewhat expanded versions of a collection of six volumes, first published 
in Portuguese by the Brazilian Mathematical Society in 2012 and currently in its 
second edition. 

The material collected here and in the other two volumes is based on course 
notes that evolved over the years since 1991, when I first began coaching students 
of Fortaleza to the Brazilian Mathematical Olympiad and to the International Math- 
ematical Olympiad. Some ten years ago, preliminary versions of the Portuguese 
texts also served as textbooks for several editions of summer courses delivered at 
UFC to math teachers of the Cape Verde Republic. 

All volumes were carefully planned to be a balanced mixture of a smooth and 
self-contained introduction to the fascinating world of mathematical competitions, 
as well as to serve as textbooks for students and instructors involved with math clubs 
for gifted high school students. 

Upon writing the books, I have stuck myself to an invaluable advice of the 
eminent Hungarian-American mathematician George Pélya, who used to say that 
one cannot learn mathematics without getting one’s hands dirty. That’s why, in 
several points throughout the text, I left to the reader the task of checking minor 
aspects of more general developments. These appear either as small omitted details 
in proofs or as subsidiary extensions of the theory. In this last case, I sometimes 
refer the reader to specific problems along the book, which are marked with an * 
and whose solutions are considered to be an essential part of the text. In general, 
in each section I collect a list of problems, carefully chosen in the direction of 
applying the material and ideas presented in the text. Dozens of them are taken from 
former editions of mathematical competitions and range from the almost immediate 
to real challenging ones. Regardless of their level of difficulty, generous hints, or 
even complete solutions, are provided to virtually all of them. 

As a quick look through the Contents pages readily shows, this time we con- 
centrate on combinatorics, number theory, and polynomials. Although the chapters’ 
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names quickly link them to one of these three major themes, whenever possible or 
desirable later chapters revisit or complement material covered in earlier ones. We 
now describe, a bit more specifically, what is covered within each major topic. 

Chapters | through 5 are devoted to the study of basic combinatorial techniques 
and structures. We start by reviewing the elementary counting strategies, emphasiz- 
ing the construction of bijections and the use of recursive arguments throughout. We 
then go through a bunch of more sophisticated tools, as the inclusion-exclusion prin- 
ciple and double counting, the use of equivalence relations, metrics on finite sets, 
and generating functions. Turning our attention to the existence of configurations, 
the pigeonhole principle of Dirichlet and invariants associated with algorithmic 
problems now play the central role. Our tour through combinatorics finishes by 
studying some graph theory, all the way from the basic definitions to the classical 
theorems of Euler (on Eulerian paths), Cayley (on the number of labeled trees), and 
Turan (on complete subgraphs of a given graph), to name just a few ones. 

We then turn to elementary number theory, which is the object of Chaps. 6-12. 
We begin, of course, by introducing the basic concepts and properties concerned 
with the divisibility relation and exploring the notion of greatest common divisor 
and prime numbers. Then we turn to diophantine equations, presenting Fermat’s 
descent method and solving the famous Pell’s equation. Before driving through a 
systematic study of congruences, we make an interlude to discuss the basics of 
multiplicative arithmetic functions and the distribution of primes, these two chapters 
being almost entirely independent of the rest of the book. From this point until 
Chap. 12, we focus on the congruence relation and its consequences, from the very 
beginnings to the finite field Zp, primitive roots, Gauss’ quadratic reciprocity law, 
and Fermat’s characterization of integers that can be written as the sum of two 
squares. All of the above material is, here more than anywhere else in the book, 
illustrated with lots of interesting and challenging examples and problems taken 
from several math competitions around the world. 

The last nine chapters are devoted to the study of complex numbers and 
polynomials. Apart from what is usually present in high school classes—as the 
basics of complex numbers and the notion of degree, the division algorithm, and 
the concept of root for polynomials—we discuss several nonstandard topics. We 
begin by highlighting the use of complex numbers and polynomials as tags in 
certain combinatorial problems and presenting a complete proof of the fundamental 
theorem of algebra, accompanied with several applications. Then, we study the 
famous theorem of Newton on symmetric polynomials and the equally famous 
Newton’s inequalities. The next theme concerns interpolation of polynomials, when 
particular attention is placed on Lagrange’s interpolation theorem. Such a result 
is used to solve linear systems of Vandermonde with no linear algebra, which in 
turn allows us to, later, analyze an important particular class of linear recurrence 
relations. The book continues with the study of factorization of polynomials over Q, 
Z, and Z,, together with several interesting problems on irreducibility. Algebraic 
and transcendental numbers then make their appearance; among other topics, we 
present a simple proof of the fact that the set of algebraic numbers forms a field 
and discuss the rudiments of cyclotomic polynomials and transcendental numbers. 
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The final chapter develops the most basic aspects of complex power series, which 
are then used, disguised as complex generating functions, to solve general linear 
recurrence relations. 

Several people and institutions contributed throughout the years for my efforts 
of turning a bunch of handwritten notes into these books. The State of Ceara 
Mathematical Olympiad, created by the Mathematics Department of the Federal 
University of Ceara (UFC) back in 1980 and now in its 37th edition, has since 
then motivated hundreds of youngsters of Fortaleza to deepen their studies of 
mathematics. I was one such student in the late 1980s, and my involvement with this 
competition and with the Brazilian Mathematical Olympiad a few years later had a 
decisive influence on my choice of career. Throughout the 1990s, I had the honor 
of coaching several brilliant students of Fortaleza to the Brazilian Mathematical 
Olympiad. Some of them entered Brazilian teams to the IMO or other international 
competitions, and their doubts, comments, and criticisms were of great help in 
shaping my view on mathematical competitions. In this sense, sincere thanks go 
to Joao Luiz de A. A. Falc4o, Roney Rodger S. de Castro, Marcelo M. de Oliveira, 
Marcondes C. Franga Jr., Marcelo C. de Souza, Eduardo C. Balreira, Breno de A. A. 
Falcao, Fabricio S. Benevides, Rui F. Vigelis, Daniel P. Sobreira, Samuel B. Feitosa, 
Davi Maximo A. Nogueira, and Yuri G. Lima. 

Professor Joao Lucas Barbosa, upon inviting me to write the textbooks to the 
Amilcar Cabral Educational Cooperation Project with Cape Verde Republic, had 
unconsciously provided me with the motivation to complete the Portuguese version 
of these books. The continuous support of Professor Hilario Alencar, president of 
the Brazilian Mathematical Society when the Portuguese edition was first published, 
was also of great importance for me. Special thanks go to my colleagues— 
professors Samuel B. Feitosa and Fernanda E. C. Camargo—who read the entire 
English version and helped me improve it in a number of ways. If it weren’t for 
my editor at Springer-Verlag, Mr. Robinson dos Santos, I almost surely would not 
have had the courage to embrace the task of translating more that 1500 pages from 
Portuguese into English. I acknowledge all the staff of Springer involved with this 
project in his name. 

Finally, and mostly, I would like to express my deepest gratitude to my parents 
Antonio and Rosemary, my wife Monica, and our kids Gabriel and Isabela. From 
early childhood, my parents have always called my attention to the importance of 
a solid education, having done all they could for me and my brothers to attend the 
best possible schools. My wife and kids fulfilled our home with the harmony and 
softness I needed to get to endure on several months of work while translating this 
book. 


Fortaleza, Brazil Antonio Caminha Muniz Neto 
December 2017 
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Chapter 1 ®) 
Elementary Counting Techniques sei 


In this first chapter, we develop the usual elementary tools for counting the number 
of distinct configurations corresponding to a certain combinatorial situation, without 
needing to list them one by one. As the reader will see, the essential ideas are the 
construction of bijections and the use of recursive arguments. 

Although we develop all material from scratch, the reader is expected to have 
some previous experience with elementary counting techniques, and is warned that 
the material collected here can be somewhat terse at places. 


1.1 The Bijective Principle 


In all that follows, we assume that the reader has a relative acquaintance with sets 
and elementary operations on them. Given n € N, we let /,, denote the set 


n=WiEN 1<j<nj 


of natural numbers from | to n. 

A set A is finite if A = @ or if there exists a bijection f : I, — A, for some 
néN.IfA F Gis finite and f : J, — A is a bijection, then letting a; = f(j) we 
write A = {a,,...,d,} and say that n is the number of elements of A (Problem 1| 
shows that this is a well defined notion). Also in this case, we write 


|A| =n or #A=n 


to mean that A has n elements. For the sake of completeness, we say that @ has 0 
elements and write || = 0. 
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The elementary theory of counting (configurations) has its foundations in the 
following simple proposition, to which we will systematically refer as the bijective 
principle. 


Proposition 1.1 [f A and B are nonempty finite sets, then |A| = |B| if and only if 
there exists a bijection f : A > B. 


Proof First of all, assume that there exists a bijection f : A > B.If |A| =n, we 
can take a bijection g : J, — A, so that fog: I, —> Bis also a bijection. Hence, 
|B| =n. 

Conversely, suppose that |A| = |B| = n, with bijections g : I, — A and 
h:In > B.Thenhog !: A > Bisa bijection from A to B. oO 


The following consequence of the bijective principle is sometimes referred to as 
the additive principle of counting. Before we state it, we recall that two sets A and 
B are said to be disjoint if AN B = J. 


Proposition 1.2 [f A and B are finite disjoint sets, then 
|AU B| = |A| + |BI. 


Proof Exercise (see Problem 2). oO 


A typical application of the former proposition consists in counting the number of 
different ways of choosing exactly one object out of two possible distinct kinds, such 
that there is a finite number of possibilities for each kind of object. In the statement 
of the proposition, the kinds correspond to the disjoint sets A and B, whereas the 
possibilities for each kind correspond to the elements of A and B. 

An easy induction allows us to generalize the additive principle for n finite 
pairwise disjoint sets. This is the content of the coming 


Corollary 1.3 If Aj, A2,..., An are finite pairwise disjoint sets, then 


n 


n 
#U4;=oiAjl. 
i 


j=l 


Proof Exercise (see Problem 3). oO 


For what comes next, given two sets A and B, we let A \ B denote the set 
A\ B={x eA; x ¢ B} 


and say that A \ B is the difference between A and B, in this order. If B C A, and if 
no danger of confusion arises, we shall sometimes refer to A \ B as the complement 
of B in A, in which case we denote it as B, instead of A \ B. 

The simple formula of the next corollary, which counts the number of elements 
of the complement of a subset of a finite set, is an additional consequence of the 
additive principle. 
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Corollary 1.4 /f A is a finite set and B C A, then 
|B| = |A| —|A\ BI. 
Proof Since A = B U (A \ B), a disjoint union, the additive principle gives 
|A] = |BU(A\ B)| = |B] + |A\ Bl. 


oO 


The general philosophy behind the use of the previous corollary in problems of 
counting is this: suppose we wish to count the number of elements of a certain finite 
set B, but do not know how to do it directly. An interesting strategy is to search for 
a finite set A > B such that we know how to count both |A| and |A \ B|. Then, we 
apply the formula of the corollary to compute the desired number of elements of B. 
Some concrete examples of this situation will be found in what is to come. 

Our next result generalizes Proposition 1.1 and Corollary 1.4, computing the 
number of elements of a union of two finite sets. Formula (1.1) below is known as 
the principle of inclusion-exclusion for two finite sets, and will be generalized in 
Sect. 2.1 (cf. Theorem 2.1). 


Proposition 1.5 [f A and B are finite sets, then 
|AU B| = |A| + |B] —|AN Bl. (1.1) 


Proof Since A and B \ A are finite, disjoint and such that AU B = AU (B \ A), the 
additive principle gives 


|AU B| = |AU(B\ A)| = |A|+|B \ Al. 
On the other hand, we also have the disjoint union 
B=(B\A)U(ANB), 
so that, again from the additive principle, |B| = |B \ A| + |ANMB]|. Then, |B \ A| = 
|B| — |A NM B|, and once we plug this formula into the above relation for |A U B| 
we get 
|AU B| = |A| + (B| — |AN B)). 
| 


For what comes next, recall that the cartesian product of sets A and B is the 
set A x B whose elements are the ordered pairs (a, b) witha € A andb € B.In 
mathematical symbols, 


Ax B={(a,b);a€A,b€ B}. 
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It is also worth recalling that ordered pairs possess the following important property: 
if (a,b), (c,d) € A x B, then 


(a,b) = (c,d) @a=c and b=d. 


In this respect, see Problem 5. 
The coming result, together with its subsequent corollary, are known as the 
multiplicative principle or as the fundamental principle of counting. 


Proposition 1.6 [f A and B are nonempty finite sets, then A x B is also finite, with 
|A x B] = |A|- |B. 


Proof Writing B = {y1,..., Ym} = Ujuitys} it follows from Problem 6 that 


AxB=Ax Ubon =x by), 


j=l j=l 


a disjoint union. Hence, Corollary 1.3 gives 
m m 
IA x Bl =|J(Ax ty)] = 1A x byl (1.2) 
j=l j=l 


Now, since f : A — A x {y;} given by f(x) = (x, yj) is a bijection (with 
inverse g : A x {y;} — A given by g(x, y;) = x), the bijective principle guarantees 
that |A| = |A x {y;}| for 1 < j < m. Hence, it follows from (1.2) that 


m 
|A x B] = 5° |A| = |A|-m = |A]- |B]. 
j=l 


Oo 


In applications, we ought to invoke this version of the fundamental principle of 
counting whenever we need to choose two objects simultaneously, such that one 
of the objects is of one of two possible kinds and the other object is of the other 
kind (and each kind comprises a finite number of possibilities). In the statement of 
the proposition, the two possible kinds correspond to the sets A and B, whereas the 
possibilities for each kind correspond to the elements of A and B. 

It is time we look at a concrete example. 


Example 1.7 How many natural numbers have two distinct nonzero algarisms, both 
less that or equal to 5? How many of them are such that the first algarism is smaller 
that the second one? 
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Solution Let A = {1, 2,3, 4,5}. The first part of the problem is clearly equivalent 
to counting how many ordered pairs (a, b) € A x A are such that a 4 b; the second 
part requires that a < b. 

Let us start counting the number of ordered pairs (a,b) € A x A, without 
additional restrictions. Since this is the same as counting the number of elements of 
A x A, the multiplicative principle gives 5 x 5 = 25 possible pairs (a, b). In order 
to count how many of them are such that a ¥ b, let us use Corollary 1.4: the number 
of such pairs is obtained by discounting, from the total number of pairs, those for 
which a = b. Since there are 5 pairs (a, a) with a € A, we conclude that there are 
25 — 5 = 20 pairs (a, b) witha # b. 

For what is left, note that (a, b) is an ordered pair such that a < b if and only 
if (b, a) is an ordered pair for which b > a; in other words, the correspondence 
(a, b) +> (b, a) is a bijection between the set whose elements are the ordered pairs 
(a, b) € Ax A such that a < b and that formed by the ordered pairs (a’, b') € Ax A 
for which a’ > b’. Hence, these sets have the same number of elements (namely, 
ordered pairs); since they are disjoint and their union equals the set of pairs (a, b) € 
A x A such that a # b, it follows from Proposition 1.2 that the desired number of 
ordered pairs is “ = 10. oO 


For the subsequent discussion, we need to extend the concept of cartesian product 
to an arbitrary finite number of finite nonempty sets. To this end, given finite 


nonempty sets Aj, Az, ..., An, let’s define their cartesian product A; x A2 x 
--+ x Aj, as the set of sequences (to which we shall sometimes refer to as n-tuples) 
(a1, 42,..., 4), Such that ay € Aj, a2 € A2,..., dn € An.! 


Corollary 1.8 If A,, A2,..., An are finite nonempty sets, then 


n 
|Al x Ag x +++ x Anl =] IAjl. 
j=l 
Proof Exercise (see Problem 7). oO 
The next corollary brings an important elaboration of the multiplicative principle. 


Corollary 1.9 Let A,, A2,..., Ax be finite nonempty sets with |A,| = nj, |A2| = 
n2,..., |Ag| = nz. Then, there are exactly njn2...ng sequences (aj, a2,..., Ak) 
witha; € Aj forl <j <k. 


'In view of this definition, in principle we have two distinct definitions for the elements of A x B: 
on the one hand, they consist of the ordered pairs (a, b) such that a € A and b € B; on the other, 
they are sequences (a,b) for which a € A and b € B. Since the ordered pair (a, b) is defined 
by (a, b) = {{a}, {a, b}} (cf. Problem 5) and the sequence (a, b) (with a € A and b € B) is the 
function f : {1,2} —~ AUB such that f(1) := a € A and f(2) := b © B, we come to the 
conclusion that, although we have been using the same notation, they are distinct mathematical 
objects. However, for our purposes the identification of the ordered pair (a, b) to the sequence 
(a, b) is totally harmless and will be done, from now on, without further comments. 
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Proof As we have told above, a sequence (a, a2,..., ax) such that aj € Aj, a2 € 
A2,...,a © Ax is nothing but an element of the cartesian product A; x Az x--- x 
Ax. Hence, the number of such sequences equals the number of elements of A; x 
A2 xX +--+ x Ag, which in turn equals, by the previous corollary, |Aj||A2|...|Ax| = 
njn2...Nk. Oo 


In words, the above corollary furnishes a method for counting in how many 
(distinct) ways we can choose k objects in order but independently, with the first 
object being of type 1, the second being of type 2, ..., the k-th being of type k and 
having at our disposal one or more possibilities for the choice of each type of object. 
Here again, the distinct types of objects correspond to the sets Aj,..., Ag, whereas 
the numbers of possibilities for each type of object correspond to the elements of 
the sets under consideration. 

As a special case of the situation of Corollary 1.9, we can count in how many 
ways it is possible to choose, in order but independently, k elements out of a set 
of n elements, with repeated choices being allowed. More precisely, we have the 
following 


Corollary 1.10 /f |A| = n, then there are exactly n* sequences of k terms, all 
chosen from the elements of A. 
Proof Make Aj =--- = Ax = A in Corollary 1.9. oO 


In other words, we say that the above corollary counts how many arrangements 
with repetition of k objects, chosen out of a set with n objects, there exist. 

Yet another way of looking at the result of Corollary 1.10 is to start by recalling 
that a sequence of k terms, all belonging to J,, is simply a function f : Ik > In; 
hence, what we established in that corollary was the fact that the number of distinct 
functions f : I, — Ip is exactly n*. Later, we shall compute how many of such 
functions are injective and how many are surjective. 

The coming example applies the circle of ideas above to a concrete situation. For 
its solution, the reader might find it helpful to recall the criterion of divisibility by 3, 
which will be derived in Problem 1, page 162: the remainder of a natural number n 
upon division by 3 equals that of the sum of its algarisms. 


Example 1.11 Compute the quantity of natural numbers n, of ten algarisms and 
satisfying the following conditions: 


(a) n does not end in 0. 
(b) 7 is divisible by 3. 


Solution If n = (aja2...a9a,9) the decimal representation of n, then m = 
(aja2...dg) is a natural number of nine algarisms and ajg € {1,2,3,..., 9}. 
Now, for a fixed m = (a,a2...a9), the criterion of divisibility by 3 assures 
that each of the numbers (aja2...a91), (aja2...a94) and (ajd2...a97) leave 
the same remainder upon division by 3, the same happening with the numbers 
(aja2...dg2), (ajaz2...a95) and (ajaz...d98), as well as with the numbers 
(aja2...d93), (aja2...da96) and (a,a2...a99). Moreover, letting 1, rg and 73 
denote such common remainders, then r;, r2 and r3 are pairwise distinct. 
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Hence, for each natural number m = (a\a2...d9) of nine algarisms, we have 


exactly three natural numbers n = (a,a2...d9a19) which are divisible by 3. Since 
a, € {1,2,...,9} and az,...,a9 € {0,1,2,...,9}, Corollary 1.9 guarantees that 
the total of numbers we want to compute is 9 x 108 x 3 = 27 x 108. Oo 


Problems: Sect. 1.1 


1. 


* Prove that the notion of number of elements of a nonempty finite set is a well 
defined concept. More precisely, prove that there exists a bijection f : Ij, > In 
if and only ifm =n. 


. * Prove the additive principle of counting from the definition for the number of 


elements of a finite set. More precisely, prove that if A and B are nonempty, 
finite disjoint sets, with |A| = m and |B| = n, then there exists a bijection 
filntn > AUB. 

* Prove Corollary 1.3. 

* Let A and B be nonempty finite sets, with |A| = |B]. Prove that a function 
f :A— Bis injective if and only if it is surjective. 

* Given sets A and B and elements a € A, b € B, we formally define the 
ordered pair (a, b) by letting? 


(a, b) = {{a}, {a, b}}. 


Use this definition to show that, for a,c € A and b,d € B, one has (a, b) = 
(c,d) @a=candb=d. 
* Given nonempty sets A, Bi,..., By, prove that 


Ax (U Bj) = Lia x Bj). 
j=l j= 


Moreover, if B,,..., By, are pairwise disjoint, prove that A x By,..., A x Br 
are also pairwise disjoint. 


. * Prove Corollary 1.8. 


For the next problem, we define an ordered partition of a set A as a 


sequence (Aj,..., Ax) of subsets of A satisfying the following conditions: (i) 
A = A, U...U Ag; (ii) Al, ..., Ag are pairwise disjoint. In this case, we use 
to say that Aj, ..., Ax (in this order) form an ordered partition of A into k of 


its subsets. 
Let n, k € Nand A be a finite set with n elements. Show that there exist exactly 
k” ordered partitions of A into k subsets Aj, ..., Ag. 


Such a definition is due to Kazimierz Kuratowski, Polish mathematician of the twentieth century. 
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9. In the cartesian plane, a pawn moves according to the following rule: being at 
the point (a, b), he can go to one of the points (a+ 1,b+1),(a+1,b-— 1), 
(a—1,b+1) or (a—1, b — 1). If the pawn starts at the point (0, 0), how many 
distinct positions can he occupy after his first n moves? 

10. (AIME—adapted) Given k,n € N, do the following items: 


(a) For a given integer 1 < j <n, prove that there are exactly (n — j + 1)* 
sequences formed by k elements of J, (possibly with repetitions) such that 
the smallest term of the sequence is greater than or equal to /. 

(b) For a given integer 1 < j <n, prove that there are exactly (n — j + 1)‘ — 
(n — j)* sequences formed by k elements of J, (possibly with repetitions), 
such that the smallest term of the sequence equals j. 

(c) Prove that the sum of the smallest terms of all nk sequences of k elements 
of I, (possibly with repetitions) equals 1 + 2* +... + nk. 


11. (France) Letk € N, A = {1, 2,3, ee and X be a subset of A satisfying 
the following condition: if x € X, then 2x ¢ X. Find, with proof, the greatest 
possible number of elements of X. 


1.2 More Bijections 


In this section, with the elementary results of the previous section at our disposal, we 
present some instances of deeper applications of the bijective principle to establish 
the equality of the numbers of elements of two finite sets. 

We start by computing, in two different ways, the number of subsets of a finite 
set. If A is any set, we let P(A) denote the power set of A, i.e., the family 


P(A) ={B; BCA}. 


Given finite sets A and B, both with n elements, a bijection f : A — B naturally 
induces a bijection f : P(A) — P(B), defined for C C A by 


F(C) ={f@); x € Ch. 


In words, given a subset C of A, we let f(C) be the subset of B whose elements are 
the images of the elements of C by /. In particular, note that 


f@) ={f); x € BD} =9, 


since it is impossible to choose x € @. 


3In Set Theory, a family is a set whose elements are also sets. 
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Hence, as a corollary to the bijective principle, if A and B are nonempty finite 
disjoint sets, then 


|A] = |B] = |P(A)| = |P(B)I. (1.3) 


We shall also need another concept, which will be useful in future discussions. 


Definition 1.12 Let A be a nonempty set. For B C A, the characteristic function 
of B (with respect to A) is the function xg : A — {0, 1}, defined by 


0, ifx ¢B 


: 1.4 
lifxeB on 


XB(x) = 


For example, if A = {a,b,c,d,e, f,g} and B = {c, f, g}, then xg is the 
function from A to {0, 1} such that 


xB(a) = xB(b) = xB(@) = xB(e) =0 and xg(c) = xa(f) = xa(g) = 1. 


For what follows, if A = {a),...,d,} is a set with n elements, whenever 
convenient we can associate to A the sequence (a1, ..., a,); whenever we do that, 
we shall say that (a), ..., d,) is an ordering of (the elements of) A or, sometimes, 
the natural ordering of A. 

If B Cc A, a (combinatorially) more interesting way of looking at the char- 
acteristic function xg of B with respect to A is to consider it as a sequence of 
0’s and 1’s, with the positions of the 1’s (with respect to the natural ordering of 
A) corresponding to the elements of B. For instance, let A = {a,b,c,d,e, f, g}, 
furnished with the ordering induced from the lexicographical (i.e., alphabetical) 
order; if B = {c, f, g}, then the characteristic function of B corresponds to the 
sequence (0, 0, 1, 0,0, 1, 1). 


More generally, let A = {a,,...,a,}, furnished with the natural ordering. For 
B CA, the characteristic sequence of B in A is the sequence sg = (@,..., Qn), 
such that 
0, if aj ¢ B 
= . 1.5 
" eee _— 


We can finally compute the number of subsets of a set with n elements. 
Theorem 1.13 A set with n elements has exactly 2” subsets. 


Proof Let A = {aj,...,@,} be a set with n elements, furnished with its natural 
ordering, and let S be the set of sequences of n terms, formed by the elements of the 
set {0, 1}. By Corollary 1.10, we have |S| = 2”. We will show that |P(A)| = 2”, 
and to this end it suffices to show that the function 


f:P(A)— S 
B +> sB 
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(i.e., the function which associates to each subset of A its characteristic sequence 
with respect to A) is a bijection. For what is left to do, check that the function 
g:S— P(A), given by 


8(Q1,..., An) = {aj € A; aj = I}, 


is the inverse of f. oO 


It is instructive to note that we can give another proof of the previous result, this 
time relying more directly on the bijective principle. Indeed, by (1.3) it suffices to 
show that the set A = {0, 1,..., — 1} has precisely 2” subsets. To this end, we let 


f = PUA) = 10 1,2,...,2" = 1} 


be given by f(@) = 0 and, for @ 4 B = {ay,...,ax} C A, withO < a, <--- < 
a <n—I, 


f(B) = 2 4-0 + 2%, 
Since 
(294 eee oP 4 eek SK 


f is well defined. To conclude that it is a bijection, recall Example 4.12 of [8], which 
shows that every integer 1 < m < 2” — 1 can be written, in a unique way up to the 
order of the summands, as a sum of distinct powers of 2 (obviously, none of these 
can be bigger that 2”~!); such a way of writing m is called its binary representation. 
Thus, the uniqueness of the binary representation is equivalent to the injectivity of 
jf, whereas the existence of such a representation is equivalent to the surjectivity of 
f. 

Notice how the above proof highlights the strength of the bijective principle in 
counting problems. Let us see two more examples. 


Example 1.14 Let n be a natural number of the form 4k + 1 or 4k + 2, for some 
nonnegative integer k. Prove that J, contains exactly 2”! subsets with even sum of 
elements (and under the convention that the sum of the elements of the empty set is 
0). 


Proof Let Fo and F; be the families of the subsets of J, with sum of elements 
respectively even and odd. Since Fo and Fj are disjoint and such that Fo U Fy = 
P(A), it follows from the additive principle and Theorem 1.13 that 

|Fol + Fil = [P(A)| = 2". 


Thus, if we show that |Fo| = |F1|, we will get |Fo| = |F\| = 2"7!. 
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To what is left to do, let’s construct a bijection between Fo and F,. To this end, 
consider the function 


f : P(A) — P(A) 
Bo w> Be? 
where B° = A \ B denotes the complement of B in A. Since (B°)* = A\(A\ B) = 
B, we have f(f(B)) = B for every B C A; hence, f o f = Idp,,), so that f isa 
bijection (cf. Example 6.40 of [8]). Now, the idea is to show that, ifm = 4k + 1 or 
n = 4k + 2, then f induces a bijection between Fo and Fj. 
Given B C A, we have 


n 


xt Dreyer 


xeEB xeBCe xeEA x=1 


= (4k + 1)(2k +1) or (2k + 1)(4k + 3), 


according to whether n = 4k + | orn = 4k + 2. In any case, 


Yixt ae: 


xeB xEBC 


is an odd number, so that the sums of the elements of B and of B° have distinct 
parities; in symbols, 


Be Fos Boe F,. 


Therefore, the restriction g of f to Fo applies Fo into F;, whereas the restriction 
h of f to F, applies F; into Fo. However, since g and h are clearly inverses of 
each other (for, f = f~'), it follows that g and h are also bijections, so that 
|Fo| = |Fil. Oo 


The bijective principle is particularly useful to understand the properties of the 
partitions of a natural number n. Here and in all that follows, a partition of the 
natural number 7 is a way of writing n as a sum of (one or more) not necessarily 
distinct natural summands. For example, the distinct partitions of 4 are 


4=14+3=14+14+2=14+14141=2+2. 


The coming example is due to L. Euler. 


Example 1.15 (Euler) Prove that the number of partitions of a natural n in odd 
summands equals the number of partitions of n in distinct summands. 


Proof Letting Z, denote the set of partitions of n in odd summands and D,, the set 
of partitions of n in distinct summands, it suffices to construct a bijection f : Z, > 
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D,,. To understand how to define f, consider the following partition of 51 in odd 
summands: 


1+14+34+34+343434+3+4+5454+74+74+47. 
Then, we can also write 
51 =2-14+6-342-5+43-7 
SO Ee eS 7 
=2!.142?-342'.342!.542'-742°-7 
=2+12+6+10+14+7, 
which is a partition of 51 in distinct summands. In what follows, we shall make the 


above particular case into a general argument to get the desired bijection. 
A partition P € T,, is 


na ltr FLE3 He +3454 0 $547 4H tT$, 
a ee 


a a3 as aq 


where a1, 43, a5,... > O and, from some natural number k on, the number a2; of 
summands equal to 2k + 1 is 0. Then, we can write 


n=a,-l+a3-3+a5-5+a7-7+-:-. 


From here, in order to get a partition f(P) of n into distinct summands, substitute 
each positive coefficient 42,41 by its binary representation (cf. Example 4.12 de [8]) 
and, if 2! is one of the summands of such a representation, let 2! (2k + 1) be one of 
the summands in f(P) (you can easily notice that this is exactly what we did in the 
particular case of the partitions of 51). 

By systematically proceeding this way, we claim that we get a partition of n into 
distinct summands. Indeed, any two summands of f(P) can be written as 


2/1 (2k; +1) and 22(2k. + 1): 


if kj ~ k2, then such summands are clearly different from one another; if kj = 
ko, then we must have J; ¥ lo, for 2 and 22 are two summands of the binary 
representation of a2,,+1 = 42%,+1. In any case, f is well defined. 

To prove that f is a bijection, let’s define a function g : D, — TZ, (which will be 
the inverse of f) in the following way: let Q be the partition 


n= (my +m +--+ + my) + (31 +132 +--+ +32) 
b (migy 1 ittsa 2b 535) “Fo 
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of n into distinct summands, where we have grouped, within each pair of 
parentheses, all summands with a single odd part. More precisely, what we are 
saying is that m2x41,; = hi (2k + 1), for some nonnegative integer /;. Then, 


morepi ibe + mre 15% = 2K +1) +++ +2! 2k + LD) 
= (2 + .+- 4+ 2) (2k + 1), 


so that, letting a2,41) = 2h 4...42%, we get 
n=a,-l+a3-3+a5-5+a7-7+--:-. 
We thus define g(Q) to be the partition 


na ltr FLE3 H+ 34540 $547 $0 t7$, 
ee — eee 


a a3 a5 ay 


so that g(Q) € Ty. 
Finally, it is immediate to verify that f and g are inverses of each other. Oo 


Problems: Sect. 1.2 


. Compute the number of subsets of the set J, which contain n. 

. * (APMO) We are given a natural number n and a finite nonempty set A. Show 
that there are exactly (2” — 1)/4! sequences (A1, A2,..., An), formed by subsets 
of A such that A = A; UA2U...U Ap. 

3. (Soviet Union) We are given n straight lines in the plane, in general position, 1.e., 
such that no two of them are parallel and no three of them pass through the same 
point. Compute the number of regions into which these lines divide the plane. 

For the next problem, we say that a family F of subsets of /, is an intersecting 
system if, for every distinct A, Be F, wehave ANB FAV. 

4. (Bulgaria—adapted) Let F be an intersecting system in J;,. 


NOR 


(a) Give an example of such an F in which |F| = 2”~!. 
(b) Prove that |F| < 2”~!, for any such F. 


5. * Givenn €N, let Z, be the family of subsets of J, with odd numbers of 
elements. If A = {x1,x2,...,xX2¢41} € Tn, with x1 < x2 < +++ < x2K44, 
we say that x,41 is the central element of A, and write xx41 = c(A). In this 
respect, do the following items: 


(a) For 1 < x1 < x2 <--+ < x2%41 <n, show that the correspondence 


(x1, %2,.-.,X2e¢1} (n+ 1—xoe41,...,n +1—x2,n+1— x4} 
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establishes a bijection between the sets in Z, having central elements 
respectively equal toi andn + | —i. 

(b) In Problem 5, page 20, we will show that |Z,,| = 2-1 Use the result of item 
(a), together with this fact, to show that 


Y> c(A) = (n $1) - 2°. 


AeTy 


6. (TT) Given n € N, we define the diversity of a partition of n as the number of 
distinct summands in it. Also, let p(”) stand for the number of distinct partitions 
of n and q(n) for the sum of the diversities of all of the p(n) partitions of n. 
Prove that: 


(a) qa) = 1+ pd)+ p@)+-:-+ pa—1). 
(b) q(n) < V2np(n). 


1.3. Recursion 


The general philosophy behind recursive counting is the following: we wish to count 
the number a, of elements of a set Ay, which is declared by means of some specific 
rule(s) concerning the natural number n. We do this in a two-step process: firstly, 
we use the declaration of A, to get a recurrence relation for the sequence (dy )n>1, 
1e., arelation of the form 


ay = F(a,...,4n-1,0), (1.6) 


where F is some function of n variables; secondly, we use the recurrence relation, 
together with algebraic and/or analytical arguments, to compute or estimate a, in 
terms of n. 

In this section we concentrate our efforts in the first step above, by examining 
in detail some specific examples ranging in difficulty from almost trivial to real 
challenging. As for the second step, for the time being we assume that the reader is 
familiar with the solution of linear recurrence relations of order at most three and 
with constant coefficients, as presented in Chapter 3 of [8]. Chapters 3 and 21 will 
develop more powerful tools for the treatment of more general recurrence relations. 

We start with a recursive counting for the number os subsets of a finite set. 


Example 1.16 If A is a set with n elements, then A has exactly 2” subsets. 


Proof Let a, be the number of subsets of A (here, we are implicitly using (1.3) 
when we write |A| as a function of n). For a fixed x € A, there are two kinds of 
subsets of A: those which contain x and those which do not. 
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If B is a subset of A containing x, then B \ {x} is a subset of A \ {x}; conversely, 
if B’ is a subset of A \ {x}, then B’ U {x} is a subset of A containing x. Since 
such correspondences are clearly inverses of each other, we conclude that there are 
as many subsets of A containing x as there are subsets of A \ {x}; however, since 
|A \ {x}| = n — 1, it follows that there are exactly a,_; of such subsets of A. On 
the other hand, since the subsets of A not containing x are precisely the subsets of 
A \ {x}, we have exactly a,_ subsets of this second kind. 

The argument of the previous paragraph clearly gives a, = 2a,—, for every 
n > 1. Since we obviously have a; = 2, the formula for the general term of a GP 
gives 


Gn = 41 - gn-l = 2". 


Our next example provides a recursive approach to Problem 3, page 13. 


Example 1.17 (Soviet Union) We are given n straight lines in the plane, in general 
position, i.e., such that no two of them are parallel and no three of them pass through 
the same point. Compute the number of regions into which these lines divide the 
plane. 


Solution Let a; be the number of regions into which the plane gets divided by some 
k straight lines in general position. We trace out one more straight line, say r, such 
that the k + 1 resulting lines are also in general position. Since r intersects the k 
other lines in k distinct points, we conclude that r gets divided in k + | intervals by 
these k points. In turn, each one of these k + | intervals corresponds to exactly one 
region, from the a; regions we had, that r splits into two new regions. Therefore, 
when we trace out line r we extinguish k + 1 older regions and generate 2(k + 1) 
new regions. Thus, if az; denotes the total number of regions we get after tracing 
out line r, we conclude that 


Akt) = ae — (K+ 1) + 24+ 1) =a t+ (K+ 1). 


Finally, since aj = 2, the formula for telescoping sums (formula (3.15) of [8]) 
gives 


n—-1 n—-1 


Gn = 41+) (4x41 — a4) = 2+ DK +1) 


k=1 k=1 


=14 Dears ee. 
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The coming example is perhaps the most celebrated of all elementary applica- 
tions of recursive reasoning, being known as the Fibonacci problem.* 


Example 1.18 A couple of rabbits starts generating descendants when is at least 2 
months old. When this is so, it generates a new couple of baby rabbits per month. 
If we start with a couple of just-born baby rabbits at month one, how many couples 
will we have after 12 months? 


Solution Let F,, denote the total number of couples of rabbits after n months, so 
that F; = 1 and Fy = | (the first couple of descendants will be born only after 3 
months, so will be counted just in F3). 

Now, let’s look at the Fx42 couples of rabbits we will have after k + 2 months. 
There are two possibilities for such a couple: either it already existed after k + 1 
months, or else it was born at the (k + 2)-th month. The first possibility amounts, 
by definition, to a total of Fy, +1 couples. As for the second one, note that the couple 
of rabbits under consideration descends from one of the couples that already existed 
after k months; conversely, each couple that already existed after k months generates 
a new couple of descendants that will be born at the (k + 2)-th month, so that this 
gives an additional number of F; couples of rabbits after k + 2 months. 

Hence, the total numbers of couples of rabbits after k, k + 1 and k + 2 months 
are related by the recursive relation 


Fep2 = Fepit Fr, 
for every k > 1. Thus, 
F3=Fo+ Fi =2, Fya=Pot+h) =3,..., Fiz = 144. 


oO 


In the notations of the previous example, we recall that (F;,),>1 is known as the 
Fibonacci sequence. 


Example 1.19 Using the letters A, B and C, compute how many words of ten letters 
(not necessarily with a meaning) one can form, so that there are no consecutive 
consonants. 


Solution For n > 1, let a, be the number of words of n letters A, B, C satisfying 
the given condition on consonants. For k > 3, a word of k letters can finish in A, 
B or C. If it finishes in A, the k — 1 remaining letters form any of the a,_1 word 
of k — 1 letters and without consecutive consonants; if it finishes in B or C, then 


4 After Leonardo di Pisa, also known as Fibonacci, Italian mathematician of the eleventh century. 
Apart from its own contributions to Mathematics—as the problem we are presently describing— 
one of the greatest merits of Fibonacci was to help revive, in Middle Age Europe, the Mathematics 
of Classical Antiquity; in particular, Fibonacci’s famous book Liber Abaci introduced, in Western 
Civilization, the Hindu-Arabic algarisms and numbering system. 
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the next to last letter must necessarily be an A (since we cannot have consecutive 
consonants), and the k — 2 initial letters form any of the ag_2 words of k — 2 letters 
having no consecutive consonants. 

The reasoning of the previous paragraph gives us the recurrence relation 


Ok = Ag—| + 2ag_2, V k = 3. 
Since a; = 3 and az = S (the possible words of two letters and with no consecutive 


consonants are AA, AB, AC, BA and CA), we successively compute a3 = 11, 
a4 = 21; a= 43, a = 85, ay = 171, ag = 341, ag = 683 and aji0 = 1365. | 


As the two last examples suggest, there are several combinatorial situations, of 
distinctive recursive characters, that give rise to the algebraic problem of computing, 
in terms of n € N, the n-th term of a sequence (a,)y,>1 that satisfies a certain 
recurrence relation. The coming example, of professor Emanuel Carneiro, generates 
a third order linear recurrence relation. In this respect, perhaps the reader might find 
it helpful to pause for a moment and review the content of Section 3.3 of [8]. 
Alternatively, he/she can take a quick look at Problem 5, page 79, or at the material 
of Chap. 21 (especially Theorem 21.22.) 


Example 1.20 For each set A = {x1, X2,..., X} of real numbers, with x1 < x2 < 
- < Xm, let 


A(A)= Do xi- DD xy, 
k<j<m 1<j<k 


where k = @ 


¢ Given an integer n > 1, show that 


x A(A) = (n? +n 42) -2"73, 
DLAC In 


Solution Let d, = oe Ac1, 4(A). By examining some particular cases we easily 
get dj = 1, dz) = 4 and d3 = 14. In general, 


ioi= > AA) 


DAAC In+1 
= >> AA)+ DO AL) 
ACIn41 BAACIn (1.7) 
n+leA 
= \o AA) +a. 
ACIn41 


n+leA 
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To analyse the last sum above, let A’ = A \ {n + 1} C Jy, foreach A C In41 
such thatn + 1 € A. Writing P, and Z, for the families of subsets of 7, with even 
and odd numbers of elements, respectively, we have 


> A(A) = » A(A’ U{n + 1) + > A(A’U {n+ 1p). (1.8) 
tek A’EPn A'eL, 
n+le 


In Problem 5 (see also Problem 7, page 29) we shall prove that |P,| = |Zn| = 
2”-1. Assuming this fact for the time being, let’s separately consider the two sums 
at the right hand side of (1.8): 


(a) If A’ € Pp, it easily follows from the definition of A(-) that 
A(A'U {n+ 1) =@4+1)+ ACA, 
where A() = 0. However, since |P,,| = 2”~!, we get 


AG UwMt I= YP A) +2" at D. 
A’EPn DAA'EPy 


(b) If A’ € Ths say A= {x1, x2, ae -, XoK+1}, with x1 = AP es Ay then 


A(A’ U {n+ 1}) = A({xq, x2, ..., Xong1,n + 1) 
= (xep2 tes + x41 + (n+ D) 
— (41 + x2 +++ + XK41) 
= (nt 1) + eget +++ + X2K41) 
= eee ae) = 2 
= (n+ 1) + A(A’) = 2xq41. 


Note that x,41 € I, is the central element of A’ € T,,. Hence, letting c(A’) 
denote the central element of A’ € Z,, we obtain 


Do AGU fn +1) = DY +1) + A(4) — 2c(4)) 
AleT, AleTy, 


(n+ 1-2" 14+ S° AA) =2 YO (Ad. 


A’ETn A’eTy, 


In Problem 5, page 13 (see also Problem 18, page 31), we saw that 


> c(A’) = (n+ 1)-2"-?, 


A’eT, 
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Therefore, 


>) AU/UM4+ IY = DO A. 


A’eTy A’'eT, 


Substituting the results of items (a) and (b) successively in (1.8) and (1.7), 
we arrive at 


dna =dn+ Y> AA) 


ACIn+1 
n+leA 
=dy+(n+1)-27'+ So A(A)+ D> A) 
OAA'EP, A’eT,, 
=dh+(n41)-27!'+ Yo AA) 
DAA'CIn 
= 8d, 4+ G41). 
Now, since the sequence n +> dost = n-+1 is an arithmetic progression, 
it follows that 
dn43 — 2dn+2 dn+1—2dn _, An42 — 2dn+1 
gn+l Qn-1 _——. gn+l 


After clearing out denominators, such an equality gives us the linear recurrence 
relation 


dn+3 _ 6dn+2 2 12dn+1 a 8dn = 0. 


Finally, upon solving such a linear recurrence (cf. Problems 6 and 7, 
page 393—see also Example 3.20 of [8]) we get 


6 =6opn42)-0" 


Problems: Sect. 1.3 


1. For eachn €N, let ay be the number of distinct ways of writing n as a sum of 
summands equal to 1, 3 or 4. Prove that ayn44 = an+3 + Gn+1 + dm, for every 
n> 1. 

2. Show that every convex n-gon has exactly mn) diagonals. 
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Fig. 1.1. The tower of Hanoi 
game 


3. A2 xn checkerboard must be tiled with dominoes (each of which is supposed 
to have the shape of a | x 2 rectangle). If a, denotes the number of such distinct 
tilings, do the following items: 


(a) Prove that ax42 = axr+1 + ag, for every integer k > 1. 
(b) Compute a, as a function of n. 


4. Letn > 1 distinct circles be given in the plane, such that any two of them have 
a common chord and no three of them pass through a single point. Compute, as 
a function of n, the number of regions into which the plane gets divided by the 
circles. 

5. * We are givenn € N anda set A with n elements. 


(a) Let a, and by, respectively denote the totals of subsets of A with even and 
odd numbers of elements. Use a recursive argument to show that a, = 
Gn—1 + bn-1 = bn. 

(b) Conclude that A has exactly 2"—! subsets with an even number of elements 
and 2”—! subsets with an odd number of elements. 


The coming problem is a particular case of Kaplansky’s first lemma, which 
will be dealt with, in general form, in Example 1.28. 

6. Three nonconsecutive chairs are to be chosen out of a row of ten chairs. How 
many are the ways of doing this? 

7. The Tower of Hanoi game consists of three parallel equal rods, vertically 
attached to the surface of a table, together with a pile of n disks of pairwise 
unequal sizes, all having a hole in the center through which they can slide along 
the rods. Figure 1.1 shows the initial configuration of the game for n = 5, with 
all of the disks piled at the leftmost rod, from the smallest disk at the top to the 
largest at the bottom of the pile. The purpose of the game is to move all disks 
to the rightmost rod, possibly with the help of the central rod and subjected to 
the following rule: at no moment a disk can be placed above a smaller one, in 
any rod. For general n, let a, denote the last number of movements needed to 
finish the game. Show that: 


(a) ag41 = 2ax + 1, for every k EN. 
(b) a, = 2” — 1, for everyn EN. 


8. * Given k,n € N, the Stirling number of second kind? § (n, k) is defined 
as the number of ways of partitioning a set of n elements into k nonempty, 
pairwise disjoint subsets. For example, S(3, 2) = 3, since 


After James Stirling, Scottish mathematician of the eighteenth century. With respect to Stirling 
numbers of second kind, see also Problem 4, page 57. 


1.3 


10. 


11. 


Recursion 21 
{1,2} U {3}, {1,3} U {2} and {2,3} U {1} 


are the only ways of partitioning {1, 2, 3} into two nonempty disjoint subsets. 
(a) Verify that S(n, 1) = S(n,n) = Land S(n,k) = Oifn <k. 
(b) Prove that, for 1 < k <n, one has the recurrence relation 
Sa+1,k+1)=S(n,k)+(k+1)S(,k+ 1). 
(c) Use items (a) and (b) to compute the number of ways of distributing 7 
distinct coins into three equal 3 boxes, such that no box stays empty. 


The coming problem will be posed two other times along these notes: one in 
Problem 8 and the other in Problem 8, page 92. 


. Compute, in terms of n € N, the number of sequences of n terms, all of which 


equal to 0, 1, 2 or 3 and having an even number of 0’s. 

A flag with n strips consists of n consecutive horizontal strips, each of which is 
colored red, white or blue and such that any two adjacent strips have different 
colors. 


(a) Calculate how many are the distinct flags with n strips. 

(b) If a, denotes the total number of flags with n strips and such that the first 
and last strips have different colors, show that aj41 +d, = 3-2” for every 
n>l. 

(c) Compute a, as a function of n, for every n > 1. 


(Bulgaria) A finite nonempty set A of positive integers is said to be selfish if 
|A| €¢ A. A selfish set A is minimal if A does not contain a selfish set different 
from itself. Do the following items: 


(a) Prove that the number of minimal selfish subsets of J, that do not contain 
n coincides with the number of minimal selfish subsets of [,_1. 

(b) Givenm € Zand X C Z, let X+m denote the set X¥+-m = {x+m; x € X}. 
Show that, for every integer n > 2, the correspondences 


Aw (A\{n})—1 and BR (B+ DU {n} 


are inverse bijections between the family of minimal selfish subsets of J, 
that do not contain n and that of minimal selfish subsets of J,_2. 

(c) Show that J, has exactly F,, minimal selfish subsets of J,, where F;, denotes 
the n-th Fibonacci number. 


For the next two problems, given a nonempty finite set A of real numbers, we 
let o (A) and z(A) respectively denote the sum and the product of the elements 
of A, with the convention that 0(A) = 2(A) = aif A = {a}. 
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13. 


14. 
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Given a nonempty finite set X C N, let ax denote the sum 
1 
ry SS 
OZACX m(A) 
Do the following items: 


(a) Ifm € N\ X and Y = X U (m}, show that ay = (att) aged, 
(b) Givenn €N, write a, to denote a;,. Show that 


n+2 ie 1 
a = | ——_la —.. 
me n+1 , n+1 


(c) Show that a, =n for everyn € N. 
(USA—adapted) For a nonempty finite set X CN, let by denote the sum 


o(A) 


1 
ODAACX 


(a) Ifm €N\ X and Y = X U {m}, show that 


m+oa(A m+1 
by=bxt Yo MPP (EE Voy tat, 
~mm(A) — m 
OAACX 


where ay is the sum defined in the previous problem. 
(b) Givenn €N, write b, instead of b;,. Show that 


b = b + (n + 1) 
n+1 1 n . 
(c) Deduce that, for ne N, one has 


1 1 


(France—adapted) We say that a set X of natural numbers is good provided 
it satisfies the following property: for every natural number x, if x € X then 


2x ¢ X. For each natural k, let Ay = {1,2,3,..., on and by be the greatest 
number of elements which a good subset of Ax can have. 


(a) Show that by = by_2 + Oe Stor every k > 2. 
(b) For each natural n, compute b, as a function of n. 
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1.4 Arrangements, Combinations and Permutations 


As an application of the ideas of the previous section, we use recursive arguments 
to solve three specific, though very important, counting problems, namely, those of 
counting the number of arrangements without repetitions, of permutations and of 
combinations. For what follows, we recall that J, = {1,2,...,n} for every n € N. 

We start by counting, in the coming result, the number of injective functions 
between two nonempty finite sets. 


Proposition 1.21 Letn,k € N. If A is a set with n elements, then there are exactly 
n(n —1)...(2—k + 1) injective functions f : I, > A. 


Proof Let B be a set with m elements and let az, denote the number of injective 
functions f : I, > B. 

Evidently, aj, = n (since there are exactly n possible choices for f(1) € A) and, 
in this case, the formula in the statement of the proposition is true. 

From now on, assume that k > 1. Fora fixed x € A, if f : I, — A is an injective 
function such that f(k) = x, then the restriction of f to J,_, is an injective function 
f: Ik-1 — A \ {x}. Conversely, given an injective function f: Tk) > A \ {x}, 
we extend Fa to an injective function f : I, — A by letting f(k) = x. 

Since such operations of restriction and extension of injective functions are 
clearly inverses of each other, we conclude that there are as many injective functions 
f : Ik —~ A with f(k) = x as there are injective functions f : Tp-1 > A \ {x}. 
Thus, the number of such functions is exactly agy_1,,—1. However, since there are n 
possible choices for x € A (for, |A| = 1), we get the recurrence relation 


Akn = NAk—-1,n—-1; (1.9) 


which is valid for every natural n > 1. 

Now, observe that ax; = 0 for every k > 1 (for, if |A| = 1, there is no way of 
choosing distinct f(1) and f(2) in A). By induction, assume that ag, = 0 whenever 
k >nand1 <n <™m, wherem > 1 is a natural number. Given k, m € N such that 
k > m, we have k — 1 > m — 1, so that, by induction hypothesis, ag—j,m—1 = 0. 
But then, (1.9) gives aym = max—1,m—1 = 0. Therefore, it follows by induction that 
akn = 0 for every k,n € N such that k > n. Yet in this case we haven—k+1 <0, so 
that n(n — 1)...(2—k+ 1) = 0 and the formula of the statement of the proposition 
is valid. 

Suppose, from now on, that k < n. Then, again from (1.9), we have 


Akn = NAg—~1,n—1 = n(n — 1)ag—2,n-2 


=n(n—1)...(n—k+2)ay n—K+1 
=n(n—1)...n-—k+2)n—k+1). 
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Since an injective function f : I, — A is simply a sequence of k pairwise 
distinct terms, all chosen from the elements of A, we can rephrase the above 
proposition according to the following 


Corollary 1.22 [f|A| =n, then there are exactly n(n—1)...(n—k+1) sequences 
formed by k pairwise distinct elements of A. 


We can apply this corollary to a given combinatorial situation whenever it asks us 
to count how many are the ordered choices of k pairwise distinct elements of a given 
set A. For this reason, from now on we shall say that the previous corollary counts 
the number of arrangements without repetition of k pairwise distinct elements, 
chosen from a given set of n elements. 


Example 1.23 How many are the natural numbers of three pairwise distinct 
algarisms? 


Solution To choose a natural number with three pairwise distinct algarisms is the 
same as to choose a sequence (a,b,c) such that a,b,c € {0,1,2,...,9} are 
pairwise distinct and a ¥ 0. Corollary 1.4 assures that, in order to count the number 
of possible such sequences, it suffices to count the number of sequences (a, b, c), 
with pairwise distinct a,b,c € {0,1,2,..., 9}, and then to discount the number of 
such sequences of the form (0, D, c). 

By the previous corollary, the number of sequences (a,b,c) with a,b,c € 


{0, 1, 2,..., 9} pairwise distinct but without the restriction a 4 0is 10x 9x 8= 
720. On the other hand, the number of such sequences of the form (0, b,c) is 
9 x 8 = 72. Hence, the desired result is 720 — 72 = 648. oO 


The counting of arrangements without repetition has a very important conse- 
quence, based on the following 


Definition 1.24 A permutation of the elements of a nonempty set A is a bijection 
f: Ava. 


If A is finite and nonempty, it is not difficult to prove (see Problem 4, page 7) 
that a function f : A — A is bijective if and only if it is injective. Hence, letting 
k =n in the formula of Proposition 1.21, we immediately get our next result. 


Corollary 1.25 /f |A| = n, then there are exactly n! permutations of the elements 
of A. 


Given a finite set A with n elements and a natural number k such that 0 < k <n, 
the coming result uses a recursive argument to compute the number of subsets of A 
with k elements each. In order to properly state it, recall (cf. Section 4.2 of [8]) that, 
for nonnegative integers n and k with O < k < n, one defines the binomial number 


(;,) by letting 
4 _ n! 
& ~ kin — bY 
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Moreover, for 1 < k <n such numbers satisfy the recurrence relation 


n\  (n-1 i n—-1 

ky Nk k-1)’ 
which is known as Stifel’s relation and allows us to prove that () € N for every n 
and k as above. 


Proposition 1.26 [f A is a finite set with n elements and0 < k <n, then A 
possesses exactly (7) subsets of k elements. 


Proof If k = 0 there is nothing to do, for J is the only subset of A having 0 elements 
and () = |. Thus, let 1 < k < n and Ci be the number of subsets of A with k 
elements. 

For a fixed x € A, there are two kinds of subsets of A with k elements: those 
which do not contain x and those which do contain x. The former ones are precisely 
the k-element subsets of A \ {x}; since |A \ {x}| = — I, there are exactly C= of 
these subsets of k elements of A. 

On the other hand, if B Cc A has k elements and x € B, then B \ {x} Cc A \ {x} 
has k— 1 elements; conversely, if B’ C A\{x}has k—1 elements, then B’U{x} C A 
has k elements, one of which is x. Since such correspondences are clearly inverses 
of each other, we conclude that there are as many k-element subsets of A containing 
x, as there are k — | element subsets of A \ {x}; thus, there are exactly C er such 
k-element subsets of A. 

Taking these two contributions into account, we obtain for | < k < n the 
recurrence relation 


Cha Ce + CE 


which is identical to Stifel’s relation for the binomial numbers (7). Finally, since 
Cj = 1 = (9) and C? =n = (‘) (for A has exactly n subsets of 1 element each— 
the sets {x}, with x € A), an easy induction gives C? = a) forO<k <n. oO 


In words, the previous proposition computes how many are the unordered choices 
of k distinct elements of a set having n elements; one uses to say that such choices 
are the combinations of n objects, taking k at a time. Also thanks to the former 


proposition, one uses to refer to the binomial number (7) as “n chooses k”. 


Example 1.27 When all diagonals of a certain convex octagon have been drawn, 
one noticed that there were no three of them passing through a single interior point 
of the octagon. How many points in the interior of the octagon are intersection points 
of two of its diagonals? 


Solution Firstly, note that the condition on the diagonals of the octagon guarantees 
that each one of the points of intersection we wish to count is determined by a 
single pair of diagonals. Hence, it suffices to count how many pairs of diagonals of 
the octagon intersect in its interior. To this end, note that each 4-element subset of 
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the set of vertices of the octagon determines exactly two diagonals which intersect 
in the interior of it; conversely, if two diagonals of the octagon do intersect in its 
interior, then the set of their endpoints is a 4-element subset of the set of vertices of 
the octagon. Therefore, the bijective principle assures that there are as many pairs of 
diagonals intersecting in the interior of the octagon as there are 4-element subsets 
of its set of vertices. By Proposition 1.26, then, the number of points of intersection 
we wish to count equals (:) = 70. o 


Our next example brings a beautiful application of combinations, due to the 
American mathematician of the twentieth century Irving Kaplansky and known as 
Kaplansky’s first lemma.° 


Example 1.28 (Kaplansky) Givenn, k € N, withn > 2k — 1, show that the number 


k-element subsets of J, without consecutive elements equals (” us: 


Proof We first note that if A C J, has k elements, no two of which being 
consecutive, then n > 2k — 1, for between any two elements of A there is at least 
one element of /,, not belonging to A. Hence, the assumption n > 2k — | made 
in the statement is natural and gives n —k +1 > k, so that the binomial number 
Co) is well defined. 

Now, recall that the characteristic function of A in J, is a sequence of n terms, 
with exactly k of them being equal to | and the other n — k being equal to 0; 
moreover, our assumption on A assures that any two 1’s in such a sequence are 
nonconsecutive. Since the characteristic function determines A, it suffices to count 
the number of such sequences. 

To what is left to do, let’s first consider a sequence of 2(n — k) + | terms, in 
which the n — k terms with even indices are equal to 0: 


( ’ 0, ? 0, a ele | ’ 0, ). 
—_—_—$—$— 
2(n—k)+1 


If we are to have exactly k terms equal to 1, no two of which being consecutive, it is 
necessary and sufficient to choose, from the set of nm —k + 1 positions corresponding 
to odd indices, the k positions in which we want to put the 1’s (as we previously 
pointed out, this is possible because n — k + 1 > k). According to Proposition 1.26, 
there are precisely (” eas) possible choices for such positions. Finally, once we 
erase the not chosen positions, we are left with the characteristic sequence of a set 
satisfying the conditions of the problem. Conversely, it is immediate that every such 
characteristic sequence can be obtained as above, so that the answer to our problem 


is Co, oO 
Now, we are given natural numbers k and n and nonnegative integers nj, ..., nx 
satisfying n = nj +---+n,. We also have k distinct types of objects, say a1,..., ak, 


There is also a Kaplansky’s second lemma, which will be the object of Problem 12. Kaplansky 
devised these two results in order to solve Lucas’ problem—cf. Problem 10, page 41. 
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and would like to count how many sequences of n terms have exactly n; terms equal 
to a}, ..., mm terms equal to ag. Yet in another way, we say that such a sequence is 
a permutation with repeated elements of n objects, each of which being of one of 
the types a1, ..., a and with n; objects of type a1, ..., mg objects of type ax. Let’s 
now look at an equivalent problem. 

If f : In > {a1,..., a} is a sequence satisfying the conditions of the previous 
paragraph (i.e., with n; terms equal to a), ..., mg terms equal to ax), then, taking 
inverse images, we obtain 


I= FG) Us Fp. 


At the right hand side above, the sets f~!(a jp), 1 < j <k, are pairwise disjoint and 
such that | f~! (aj)| =n, for 1 < j < k; we say that the union at the right hand side 
is the partition of /,, induced by f. Conversely, associated to a partition of J, of 
the form J, = A; U...U Ax, with |A;| =n; for 1 < j < k, we have the sequence 
fi In > {ai,...,ax} such that f(A;) = {a;} for 1 < j < k. Therefore, the 
bijective principle guarantees that a problem equivalent to the one of the previous 
paragraph is that of counting the number of distinct partitions of a set of n elements 
into k pairwise disjoint subsets Aj, ..., Ax, under the restriction that |A;| = 7,..., 
|Ax| = nk. 

This being said, the coming result solves both counting problems described 
above. 


Proposition 1.29 Letn,k ¢ Nandn =n, +---+nx be a partition of n in positive 
integers. If A is a set with n elements and (i . e denotes the number of partitions 
of A into sets Ay, ..., Ax, with |Aj;|=nj; for1 < j <k, then 


n n!} 
= ——__, (1.10) 
N1,...,Nk ni!...ng! 


Proof We shall make induction on k € N. Firstly, the case k = 1 is trivial, for in 
this case nj = n and the only subset of A with n elements is A itself. As induction 
hypothesis, let / > 2 be a natural number and assume that (1.10) is true for] <k <1 


and every partition of n into k positive integers n}, ..., ng. Then, fix a partition 
n=n,+---+n, of n in positive integers. 
The bijective principle assures that, for each subset A; of A, with |A;| = mn), 


there are as many partitions of A as in the statement of the proposition as there are 
partitions 


A\ Aj =A, U...UA/-1, 


of A \ A; into/ — 1 subsets Aj,..., Ay-1, with |Aj;| =n; forl <j </—1. 
Since |A \ A7| = n — n;, the total number of such partitions of A \ Ai is, 


by definition, equal to (Rae ah however, since there are exactly ) ways of 
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choosing the subset A; of A, the fundamental principle of counting gives the 


recurrence relation 
N1,...,N] ny,...,nj-1/ \ny 


n—n| 
dacghpS 


( n ) (n — nj)! (") n! 
Ni,...,N ni!...nj—1! \ny ni!...nj_q!n)! 


Remark 1.30 In the notations of the proposition above, it is worth noting that 


(, 2 ~ (1) 


where the right hand side denotes the usual binomial number. Such and equality 
reflects the fact that choosing a k-element subset out of a set with n elements is the 
same as partitioning such a set into two subsets, one with k elements and the other 
with n — k elements. 


Finally, by applying the induction hypothesis to (, ) , we conclude that 


Problems: Sect. 1.4 


1. Ifn € Nand A isa set with n elements, prove that there are exactly n* — n(n — 
1)...(2 —k + 1) sequences of k terms, all taken out of A but not pairwise 
distinct. 

2. * Do the following items: 


(a) Use acombinatorial argument to prove the formula for binomial expansion: 


(tyy"=)0 (jo (1.11) 


k=0 


(b) Generalize item (a), proving combinatorially the multinomial expansion 


formula: 
n 
(tere = ( eg (1.12) 
N1,...,Nk 
My sists npeZ4 
nyte-+ng=n 
where, for nj,...,mx% € Z4 such thatn = n; +--+ + nx, the number 


10. 


11. 


12. 
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. Givenn, k € N, use the result of Proposition 1.29, together with item (b) of the 


previous problem, to compute the number of ordered partitions of a set A, with 
|A| =n, into k subsets Aj, ..., Ag (cf. Problem 8, page 7). 


. Prove that the Stirling number of second kind S(n, k) (cf. Problem 8, page 20) 


can be computed with the aid of the following formula: 


1 
Sin, = 3 om ), 


1.2, Mk 
NY sees npeN 
ny+e-+ng=n 
. MO—shortlist) Given a permutation (a), a2, ..., Gn) Of In, we say that a; is 


a local maximum if a; is greater than its neighbors (for j = 1, this means that 
a, > a2; for j =n, that a, > a,_1). Compute the number of permutations of 
TI, having exactly one local maximum. 


. (OCM) We are given n > 6 points on a circle T’, and draw all (5) chords joining 


two of them. We further assume that no three of such chords pass through a 
single point of the corresponding open disk D bounded by I’. Compute the 
number of distinct triangles satisfying the two following properties: 


i. their vertices belong to D. 
li. their sides lie on three of the (5) drawn chords. 


. * Use combinations to show that a set with n elements has exactly 2”—! subsets 


with an even number of elements and 2”—! subsets with an odd number of 
elements. 


. Compute, in terms of n € N, the number of sequences of n terms, all of which 


equal to 0, 1, 2 or 3 and having an even number of 0’s. 
a) 


. * Givenn,k €N, show that the equation x; +---+x, = 7 has exactly ( kl 


nonnegative integer solutions. 

* Given n, k € N, show that the equation x; + --- +x, = 7 has exactly Gs) 
positive integer solutions. 

The purpose of this problem is to give another proof of Kaplansky’s first lemma 
(cf. Example 1.28). To this end, do the following items: 


(a) If A = {aj,..., ax} C I, is a set without consecutive elements, let x; = 
ay — 1, Xe41 =n — ag and x; =a; — aj_; —2 for2 < j < k. Show that 
X1,...,Xk41 Solves, in nonnegative integers, equation xj +--+ + xx4) = 
n—2k+1. 

(b) If x1,..., x¢41 is a solution of equation x; +---+ 2x44, =n —2k+ 1 in 
nonnegative integers, show how to use it to geta set A = {aj,..., ax} C I, 


without consecutive elements. 
Obtain Kaplansky’s first lemma from the results of items (a) and (b) and 
from Problem 9. 


(c 


Nee 


* Prove Kaplansky’s second lemma: given n, k € N, withn > 2k, the number 
of k-element subsets of [,, without consecutive elements, and considering | and 
n to be consecutive, is equal to 
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14. 


15. 
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n (n—-k 
n—k ( k ) 

For the coming problem, we define a multiset as an ordered pair (A, f), 
where A is a set and f is a function from A into N. If x € A, we say that x is 
an element of multiset (A, f); in such a case, f(x) € N is the frequency of x as 
an element of (A, f). A multiset (A, f) is finite if A is a finite set. In this case, 


the number of elements of the multiset is defined as the sum of the frequencies 
of the elements of A, seen as elements of (A, f); in symbols, 


(A, Al = 2 FQ). (1.13) 


xeA 


Informally, we denote a multiset (A, f) simply by Ay (or even A, whenever 
f is understood and there is no danger of confusion with the set A itself); we 
declare Ar between double curly braces, instead of simple ones (which are 
reserved for declaring sets), with each x € A being repeated exactly f(x) times; 
for instance, if A = {a,b,c} and f : A > Nis given by f(a) = 2, f(b) = 3 
and f(c) = 1, we declare the multiset A ¢ by writing 


Afr = {{a, a, b, b, b, c}}. 


Let n, k € N be given, with n > k, and let A be a set with k elements. Compute 
the number of multisets (A, f) of n elements. 

For each integer n > 1, prove that there are exactly ( 
equal quantities of even and odd elements. 

We are given natural numbers m and n, with m > n. If 


2n 
n 


) subsets of Jp, with 


F = {(Aj,..., An); Aj,...,An C In}, 


compute, in terms of m and n, the value of the sum 


So. Ase, UiARl 


To each permutation o = (a), d2,..., dp) of In, let 


So = (a1 — a2)? + (@2 — 43)? ++ + Gn-1 — Gn)’. 


1 n+1 
nl s, = ( 3 ), 
oO 


where, in the left hand side above, o varies over all n! permutations of [,,. 


Show that 
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17. 


18. 


19. 


(Saint Petersburg—adapted) We are give 2n + 1 points on a circle, such that 
no two of them are the endpoints of a diameter. Prove that, among all triangles 
having three of the given points as their vertices, at most 


n(n + DQn + 1) 


of them are acute.’ 
* The purpose of this problem is to use the material of this section to present 
another solution to Problem 5, page 13. To this end, do the following items: 


(a) Fixed 1 <i <n, show that is the central element of exactly 


EC CP) Gy) 


sets A € Z,, where the above sum extends to all indices j satisfying 0 < 
J <min{i — 1,n —i}. 
(b) Conclude that 


n 


y> (A) = >> (73)! =(n+1)-2", 


AcTy, i=l 
(IMO shortlist—adapted) Let® n € Nand f : I, > I, be such that f(f(x)) = 
x for every x € In. 


(a) Show that f is a bijection and that, if f has exactly k fixed points, then 
n — k must be even. 


(b) Conversely, given 0 < k < _ n such that n — k is even, show that 
for each choice of elements x1,...,x, © J, and for each partition 
{{a1, bi}, ..., {a7, bi }} of the remaining n — k elements of J, in subsets 
of two elements each, there exists a single f : I, > J, having x1,..., xx 
as its fixed points and satisfying f(a;) = b; for 1 <i <Jand f(f(x)) =x 
for each x € In. 

(c) Conclude that the number of functions f : J, — J, such that f(f(x)) =x 


for each x € I, is given by 


© )-u)- Ee 


O<k<n O<k<n 
n—k even (n—k)/2 n—k even 


7For another proof to this problem, see Example 4.18. 


8For a generalization of this problem, see Problem 8, page 58. 
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* (BMO—adapted)? Let n > 2 be an integer and F be a family of 3-element 
subsets of J,,, such that, if A, B € F and A # B, then |AN B| < 1. Prove that 


ve) 
Flaee" =n): 


(Romania) Let k, m € N be given, with m > k, and let A be a set of m elements. 
Choose pairwise distinct subsets A;, A2,..., An of A, such that |A;| > k for 
1 <i <nand|AjNAj| <k—1forl <i < j <n. Prove thatn < () 

Let A = {aj, a2,..., an}, and let P|, Po,..., Py, be pairwise distinct 2-element 
subsets of A, such that 


POP Ae S34 2k on Py = (ai. 0;). 


Prove that each element of A belongs to exactly two of the sets P|, P2,..., Pn. 


°For the other half of the original problem, see Example 2.29. 


Chapter 2 ®) 
More Counting Techniques sei 


In this chapter we study a few more elaborate counting techniques. We start by 
discussing the inclusion-exclusion principle, which, roughly speaking, is a formula 
for counting the number of elements of a finite union of finite sets. The presentation 
continues with the notion of double counting for, counting a certain number of 
configurations in two distinct ways, to infer some hidden result. Then, a brief 
discussion of equivalence relations and their role in counting problems follows. 
Among other interesting results, we illustrate it by proving a famous theorem of 
B. Bollobas, on extremal set theory. The chapter ends with a glimpse on the use of 
the language of metric spaces in certain specific counting problems. 


2.1 The Inclusion-Exclusion Principle 


In this section we extend the formula of Proposition 1.5 for the number of elements 
of the union of two finite sets. The following result is known as the inclusion- 
exclusion principle, and is usually attributed to the French mathematician of the 
eighteenth century Abraham de Moivre. 


Theorem 2.1 (de Moivre) Jf A,, Ao, ..., An are finite sets, then 


n 
|Ay UA2U...U An] = ven! 2 |Ai, MN... Ail, (2.1) 
k=1 Ip <:++<ig 
with the last sum above extending over all sequences (i, i2, ..., ix) of integers such 


that1 <i) <i2 <---<ip <n. 


Proof Fix an x € A; U A2U...U A, and suppose that x belongs to exactly / of 
the sets Aj, Az, ..., An. We shall show that the expression at the right hand side 
of (2.1) counts x exactly once. 
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Note that, for a fixed 1 < k <n, x is counted in the sum 


So [A O... Ai | 


1] <:++<ig 


as many times as the number of sequences (7), ..., ix) such that 1 < ij <--- < 
ix <nandx € Ai, N...N Az. 

Since x belongs to exactly / of the given sets, we can restrict ourselves to the case 
in which k < /; indeed, if k > J, then x will not belong to at least one of the sets 
Ai; see tAtis so that x ¢ Ai, N...N Aix. 

For k < /, the number of sequences (i), ..., ix) as above equals the number of 
ways of choosing k of the / sets containing x. Therefore, there are exactly (,) such 
sequences, and we conclude that x is counted at the right hand side of (2.1) exactly 


l 


So0() 


k=1 


times. However, with the aid of the binomial expansion we get 


; 1 I : 1 
revi()) = (0) - rie-'(;) =1-(1-1) =1, 
k=0 


k=1 
as wished. oO 


Example 2.2 Letn > 3 be anatural number. Show that there does not exist sets A, 
A2,..., An, With n elements each and satisfying the following conditions: 


(a) Any two of the A;’s have exactly two elements in common. 
(b) Any three of the A;’s have no elements in common. 


Solution By the sake of contradiction, assume that there does exist sets A}, A2,..., 
Ay Satisfying the stated conditions. If we show that |Aj U Az U...U Ay| = 1, we 
would have Aj C Ay UA2U...UA, and |A;| = |A; UUA2U...UA,| for] <i <n, 
so that Aj = A; UA2U...U A, for 1 < i <n. In particular, this would give 
A, = A2 =--- = Aj, and item (b) will give us a contradiction. 

To what is left, let’s compute the number of elements of Ay UA2U...U Ay, with 
the aid of the inclusion-exclusion principle: 


n 
JA1U...U An] = )0(-DAT SO [Aa 9... Ag 
k=1 


ip <-++<ig 


= 0 14ial -— >> An 9 Ad, 
iy 


i) <ig 
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where, in the last equality above, we used the fact that Aj, N Ai, N Ai, = @ for 
1 <i <i2 < i3 <n. Now, note that 


n n 
2 
dil4al = oA = Donan 
iy i=1 i=1 


and 


y- 14; A_l = >> 2=2(5) =n —1), 


1, <i2 i <i2 
for, there are exactly (5) = mind) possible choices of indices i; and iz satisfying 


1 <i) < iz <n. Hence, Aj UA2U...U Ay has exactly n2 — n(n -lb=n 
elements. oO 


Another proof of the inclusion-exclusion principle can be given by using 
characteristic functions of sets (cf. Definition 1.12). To this end, we shall need two 
preliminary results on such functions. 


Lemma 2.3 Let A be a nonempty set. If B and C are subsets of A, then: 
(a) XB =XC@ B=C. 
(b) XBnc = xXBXxc-. 


(c) Xp =1— xe. 
(d) XBuc = XB + Xc — XBXc- 


Proof Exercise (see Problem 1). oO 


Proposition 2.4 Let A be a finite set and A\, A2, ..., An be subsets of A. If x, xj : 
A — {0, 1} respectively denote the characteristic functions of Ay U...U An and 
Aj; in A, then 


n 


x(x) =1-[ [d-x)@)) (2.2) 


j=l 
for everyx € A. 
Proof For x € A, we have 
x(x)=H=1 Saxe ALU...UA, 
edlisjsnxeA;j 
edisjsn; xja=l 


# [ [a -xj@)) =0 


j=l 
&1-]]d-xj@)=1 
j=) 


Therefore, both sides of (2.2) coincide at every x € A, as wished. oO 
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Now, observe that if B C A is finite, then 


|B] = >> x(x). (2.3) 


xeA 


On the other hand, in the notations of the former proposition, we have 


x(x) =1-] Jd —- x;@) 


j=l 


bt Se a) ae) 


1<k<n ij <:+-<ix 


ye ko). 


1<k<n i <+:<ig 


where xj,...;, denotes the characteristic function of Aj, 1... Aj, and we used item 
(b) of Lemma 2.3 in the last equality. Hence, the above and (2.3) give 


|A,U...U Ag => x@) 


xeA 


YY YS Col xe 


xEA 1<k<n i <-+- <i 


ys »- eo ae 


1<k<n i, <:+-<ix xEA 


YY Cv din... Ail. 


1<k<n i, <:+-<ix 


We now present two classical applications of the inclusion-exclusion principle. 
For the first one, we say that a permutation (a1, a2, ..., d,) of I, is a derangement 
if aj # i for 1 <i <n. This is the same as saying that the function f : I, > In, 
such that f(@j) = a; for 1 <i <n, has no fixed points. The coming example 
computes how many are these permutations of J, (in this respect, see also Problem 6, 
page 79). 


Example 2.5 Given an integer n > 1, there are precisely 


=n > neh (2.4) 


derangements of J. 
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Proof Let D, be the set of derangements of J, and, for 1 < i < n, let C; be the set 
of those permutations (a1, ..., d,) of I, for which a; = i. Letting P;,, denote the set 
of all permutations of [,,, we clearly have 


Pn \ Dn =C,UC2U...UCy. 
Hence, the inclusion-exclusion principle gives 


Py) Dal = (ep... AGI 


n 
= (2.5) 
= xe : > IC, N...NCy|- 
k=1 ip <- <i 
Since d, = |D,|, we have 
[Pn \ Dal = \Pul — |Pnl = 1! —- dh. 
On the other hand, since 
(a1,---,4n) EC, N...NC, Say =H1,..., ay = ik, 

in order to count how many permutations (a), ..., dn) of In belong to Cj, N.. .NCi,, 
it suffices to count how many are the permutations of J, \ {i1, ..., ix}. Since this is 


a set of n — k elements, it follows that 
ICi, N.C, | = [Pax] = (n — k)!. 


Substituting both these equalities in (2.5), we get 


n! — dn =p > (n —k)! 
k=1 


1] <0+<ig 
=> Ct (i) (n—b)! 
k=1 k 
n (-1)*"! 
es Ss ko 
k=1 


where, in the next to last equality, we used the fact that there are exactly (7) ways of 
choosing integers | <i; <--- < ix <n. Therefore, 


n k-1 
—1 
dy =n!—n!¥ a 
k=1 . k= 


Oo 
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Example 2.6 We are given m,n € N and finite sets A and B, such that |A| = m 
and |B| = n. Prove that the number of surjective functions f : A > B equals 


de-v'(7) (n—k)". 
k=0 k 


Proof Let Fm,» denote the set of functions f : A > B, and Sj, the subset of Finn 
consisting of the surjective ones. Also, let B = {bj,..., b,} and, for 1 <i <n, Ff; 
be the set of functions f : A — B such that b; ¢ Im(f). Then, 

Finn \ Sman = F1 U...U Fh, 


and the inclusion-exclusion principle gives 


\Finsn \ Smn| =, |Fy U . UFy| 


n 
=) E1)? > \Fi, 1... Fy. 
k=1 


1] <-++<ig 


(2.6) 


Letting Sinn = |Sm.n|, we have 
|Fin,n \ Sin,nl = |Finnl -— ISimn| =n” — Sm,ns 


where in the last equality we used the discussion in the last paragraph of Sect. 1.1. 
On the other hand, for a given function f : A — B, we obviously have 


f EFI ON... OF, & bi, ..-, di, ¢ Im(f); 
in turn, this is equivalent to the fact that f can be seen as a function from A into 
B \ {bi,,.-., bi, }. However, since |B \ {bj,,..., bi,}| =n — k (and invoking once 
more the discussion in the last paragraph of Sect. 1.1), we get 
IF, 126-0 Figl = |Fimn—kl = (2 — ky”. 
Substituting these relations in (2.6), we obtain 


n” —Smn = YDS! YT ky" 
k=1 


ip <--+<ig 


= cori) (n—k)”, 
dX ; 
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where, in the last equality, we used the fact that there are () ways of choosing 
integers 1 <i} <--- < ig <n. Hence, 


m “ an n m “ n m 
an “ey (je - 6 =e (Ha-% 


k=0 
| 


We shall reobtain the results of the two examples above, by other means, in 
Chap. 3 (cf. Problem 7, page 91, and Problem 9, page 92). 


Example 2.7 (Romania) Compute the number of ways of coloring the vertices of 
a regular dodecagon using two colors, in such a way that no monochromatic set of 
vertices is the set of vertices of a regular polygon. 


Solution A regular polygon having its vertices among those of the dodecagon is 
either an equilateral triangle, a square, a regular hexagon or the regular dodecagon 
itself. Since alternating vertices of a regular hexagon form an equilateral triangle, 
we conclude that it suffices to avoid equilateral triangles and squares. Let red (R) 
and blue (B) be the used colors. 

There are four equilateral triangles whose vertices are among those of the 
given regular dodecagon, and such equilateral triangles have pairwise disjoint sets 
of vertices. Since each one of them can be colored in exactly six distinct and 
non monochromatic ways, the number of distinct colorings of the vertices of the 
dodecagon without monochromatic equilateral triangles equals 6 = 1296. 

Let’s calculate how many of these 1296 colorings contain at least one monochro- 
matic square. To this end assume, without loss of generality, that such a coloring 
contains a red square. Each one of its vertices is a vertex of exactly one of the 
four possible equilateral triangles. Since such a vertex is red, the other two vertices 
of the corresponding equilateral triangle could be colored BB, BR or RB. By the 
fundamental principle of counting, exactly 2 - 3 - 3+ = 486 (2 colors, 3 squares and 
3 possible colorings for the other two vertices of each one of the four equilateral 
triangles) of the 1296 colorings without monochromatic equilateral triangles contain 
a monochromatic square. 

Let’s now compute how many of the 1296 colorings without monochromatic 
equilateral triangles contain at least two monochromatic squares. We do this 
considering two separate cases, according to whether the two monochromatic 
squares have equal or different colors. For each equilateral triangle, exactly one 
of its vertices is not a vertex of any of the two monochromatic squares. Assuming 
(without loss of generality) them to be red, we conclude that there is exactly one 
possible color (blue) for the third vertex. Assuming them to be one red and the other 
one blue, we conclude that there are two possible colors (either red or blue) for the 
third vertex. Since there are () = 3 ways of choosing two of the three squares, the 
number of colorings with at least two monochromatic squares equals 
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(2) -@ 1842-24) = 102 


(note that if both squares have the same color, then there are two possibilities: 
they are both blue or both red; if they have different colors, there are also two 
possibilities: BR ou RB). 

Finally, let’s calculate how many of the 1296 colorings without monochro- 
matic equilateral triangles have three monochromatic squares. The nonexistence of 
monochromatic equilateral triangles forbids the three squares to have a single color. 
Hence, there are six possible distinct colorings for their sets of vertices: BBR, BRB, 
RBB, RRB, RBR and BRR. 

Therefore, by the inclusion-exclusion principle, the number of colorings we wish 
to count is 


1296 — 486 + 102 — 6 = 906. 


Problems: Sect. 2.1 


1. * Prove Lemma 2.3. 
2. How many natural numbers from 1 to 1000 have a prime factor less than 10? 
3. * Given m,n € N, with m <n, prove that 


n-1 


(—1! (7) (n—k)™ =0. 
Lo'"”E 


4. Given n € N, compute the number of ways of forming a line with n couples, in 
such a way that no wife is a neighbor of her husband and vice-versa. 


5. (England) How many are the permutations (a1, a2, ..., a6) of J6 such that, for 
1 < j <5, the sequence (a, ..., a@;) is not a permutation of /;? Explain your 
answer. 


The next two problems admit from the reader a certain degree of acquain- 
tance with the decomposition of a natural number n > | as a product of powers 
of distinct primes, as well as with the notion of greatest common divisor of 
two nonzero integers. Specifically for Problem 7, the reader may find it useful 
to recall the following fact: ifn > 1 is natural but not prime, then n admits 
a prime divisor which is less than or equal to ./n. The necessary material is 
collected in Chap. 6. Alternatively, the reader can safely skip these problems 
without loss of continuity. 

6. Letn > 1 be anatural number and n = Bi sishe pe its factorisation as a product 
of powers of distinct primes p; < --- < px. Prove that there are exactly 


2.1 


10. 
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1 1 
n(1-—)---(1-=) (2.7) 
PI Pk 


integers 1 < m <n such that gcd(m, n) = 1. 
For the next problem, we let |x| stand for the integer part of x € R, i.e., the 
greatest integer which is less than or equal to x. Thus, we have (for instance) 


[1| = 1, | V2] = 1 and [| = 3. 


. Letn > | be natural and p; < p2 <--- < px be the prime numbers less than 


or equal to ./n. Prove that the quantity of prime numbers less than or equal to 
n is given by the formula 


ap ae ner 


I Si, <-+<ij<k J 


Forn €N, we let p(n) denote the number of distinct partitions of n in naturals 
summands. Also, for 1 < k < n we write px(n) to denote the number of 
partitions of 1 into exactly k distinct natural summands. Prove that 


pay= Yo DET SY aOp—D. 


1<k<J/2n i <l<n 


(Mongolia) Given k,m,n € N, let p = min {n, iy}. Show that equation 


xy +x2 +-+-+ 2X, = m has exactly 


: (n\ (m—jk+tD+n-1 
eo) 


j=0 


solutions (x1, X2,...,Xn) with x, € Zand0 < x, <k,forl <t <n. 

Lucas’ problem! aims at counting the number of ways in which n couples 
can sit in 2n chairs around a circle, such that no wife sits beside her husband 
and no two people of the same sex sit on adjacent chairs. The purpose of this 
problem is to present I. Kaplansky’s solution to this problem. To this end, do 
the following items: 


(a) Conclude that there are 2(n!) ways of choosing the chairs of the hus- 
bands. Once we are done with (a), conclude that the number of desired 
configurations is equal to 2(n!)a,, where a, counts the number of ways of 
distributing the wives in the n left chairs, in such a way that no wife sits 
beside her husband. 


' After Edouard Lucas, French mathematician of the nineteenth century. 


42 


2 More Counting Techniques 


(b) Label the chairs of the husbands, counterclockwisely, as hj, h2, ..., hn, 


(c 


(d 


(e 


wa 


) 


wma 


and those to be occupied by the wives (in some allowed order) as 1, 2,..., 
n, with chair 1 situated between fi; and ho, chair 2 situated between hz and 
h3, and so on. For 1 <i <n, let w; denote the wife of h;, and A2;_1 (resp. 
A2;) denote the set of ways of distributing the wives such that w; sits in 
chair i — | (resp. chair i. Here, chair 0 is the same as chair n). Conclude 
that 


ay, =n! — |A;y UU A2U A3U AGU... U Arn_1 U Aan|. 
If {i1, i2,..., in} C {1,2,..., 2n} has two consecutive elements (with 2n 
and 1 seen as consecutive), show that Aj, 1 Aj,N...N Ax = Y. 
If {ip < in <... < ig} C {1,2,..., 2n} has no consecutive elements (also 
with 2n and | seen as consecutive), show that 


|A;, NAD N...N AZ| = —DI. 


Apply Kaplansky’s second lemma (cf. Problem 12, page 29) to conclude 


that there are exactly 
2n (2n—k 
2n—k k 


ways of composing an intersection of the form Aj, M Aj, 1... Ai,, with 
{ij <i2 <... < ix} C {1,2,..., 2m} having no consecutive elements (and 
2n and 1 seen as consecutive). 


(f) Conclude that 


wy yk 20 (In—k\ | 
m= DK 1) =a : Jo k)!. 


2.2 Double Counting 


Double counting is nothing but the bijective principle in action once again, albeit 
under a different perspective. The central idea is that, if we count a certain number 
of configurations in two different ways and without making mistakes, then the 
two results thus obtained must coincide. Sometimes this naive approach generates 
interesting conclusions. 


The best way to master double counting is by looking at some examples, and we 


start with one that allows us to reobtain an important result, already derived by other 
means. 
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Example 2.8 Vf n and k are integers such that 0 < k < n, then every set with n 
elements has exactly (4) subsets with k elements. 


Proof Note first that it suffices to consider the case 1 <k <n. 

Under this assumption, fix a set A with n elements and consider the problem of 
counting how many sequences of k terms, chosen from the elements of A, can be 
formed. By Corollary 1.22, the answer isn(n — 1)...(21—k +1). 

On the other hand, by permuting the elements of each k-element subset of A we 
generate k! different sequences of k terms; moreover, it is immediate to see that the 
sequences we get this way are all sequences of k terms chosen from A, and they 
are all distinct. Hence, if (as in the previous chapter) C? denotes the number of k- 
element subsets of A, we conclude that there are exactly k!C/; sequences of k terms, 
all chosen from the elements of A. 

Now, since n(n — 1)...(n —k + 1) and k!C? are answers to the same counting 
problem, we conclude that 


KIC? =n(n—-1)...2—-—k4+1) 
or, which is the same, 


n(n—1)...n—k+1) 
k! 
n(n—1)...1—k+1)-(n—k)! 


k(n — k)! 
= n 
= (i): 


The result of the next example is known as Lagrange’s identity, after the French 
mathematician of the eighteenth century Joseph L. Lagrange. 


CE = 


Oo 


Example 2.9 Prove that, for every natural number 7, one has 


2n n\- n\* n\" n\- 
SOG eee) 
Proof Consider a group of 2n people, composed by n men and n women. We shall 
compute the number of ways of choosing exactly n of the 2n people, by using two 
different approaches. 
On the one hand, the answer obviously is (7), On the other, for 0 < k <n, the 
fundamental principle of counting guarantees that we can choose k men and n — k 


women in exactly ‘a (, is ) different ways; adding such possibilities as k varies from 
0 to n and using the fact that (71) = (,,",), we get the sum 
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(ha) 


as another answer to the posed problem. Hence, the stated equality actually take 
place. Oo 


Yet concerning the previous example, a nontrivial difficulty in the implementa- 
tion of the double counting argument relied in looking at both sides of the desired 
equality as different ways of counting the number of configurations relative to a 
single combinatorial situation. There is no general rule on how to find out such a 
counting problem; just experience and careful thinking will help. Therefore, let’s go 
on and discuss another (and tougher) example. 


Example 2.10 Let m and n be positive integers. Prove that 


EG) EG) Cm) 


J 


with the above sums ranging over all possible integer values of the indices j and k. 


Proof Let A and B be disjoint sets, with |A| = m and |B| = n. Let’s calculate the 
number of ordered pairs (X, Y), with X C A and Y Cc BUX such that |Y| =m. 


If |X| = k, there are (7) ways of choosing X; for each of them, the disjointness 


of A and B gives |B U X| = n +k, so that there are (eS) ways of choosing Y. 
Therefore, by successively applying the multiplicative and additive principles, we 
conclude that the number of such ordered pairs is 


m\ (n+k 
E(t) Cn) 

We now choose Y first. More precisely, we compute how many ordered pairs 
(X, Y) satisfy the given conditions and are such that |Y M B| = j. There are () 
ways of choosing j elements from B to put in Y; once they are chosen, we are left 
to choosing m — j elements for Y, which ought to be elements of A. Then, there 
are (3) = ("") ways of choosing these m — j elements of A, and the condition 
Y c BUX forces such elements to belong to X. Apart from these m — j elements, 
the remaining elements of X can form an arbitrary subset of the set formed by 
the remaining j elements of A, so that we can choose such a subset in exactly 2/ 
different ways. Hence, the number of pairs (X, Y) is given by 


EGG) 


J 
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Finally, since both of the above countings solve the same problem, the only 
possibility is to have the stated equality. Oo 


The coming example illustrates how one can use double counting arguments to 
get inequalities. 


Example 2.11 (IMO) Let n and k be positive integers and S be a set of n points in 
the plane satisfying the following conditions: 


(a) no three points of S are collinear. 
(b) For every point P € S, there are at least other k points of S equidistant from P. 


Prove that k < : +/2n. 
Proof Let 


F ={(A, B,C); A, B,C € S and AB = AC}. 
We shall count || in two different ways. 

Firstly, for each A € S there are at least k points of S equidistant from A and 
such points generate at least k(k — 1) pairs (B, C) such that (A, B, C) € F. Since 
there are n possible choices for A € S, we conclude that 

|F| > nk(k — 1). 

Now, for fixed B,C € S, we claim that there are at most two points A in S 
such that (A, B,C) € F. Indeed, since any such A belongs to the perpendicular 
bisector of BC, and condition (a) gives at most two of them. However, since there 
are n(n — 1) possible choices for (B, C), we conclude that 

|F| < 2n(n — 1). 

Thus, 

nk(k — 1) < 2n(n — 1) 


and, hence, 


k 


1+J8&-7 14-8 1 
< = = < = == 5+ 2n, 


oO 


We finish this section by showing how to use a double counting argument to 
obtain formula (2.4) for the number of derangements of J,,. Prior to that, however, 
we need to establish the following auxiliary result, which will also be useful in 
Sect. 18.1. 
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Lemma 2.12 If ao, aj,..., Gn and bo, bj, ..., bn are real numbers such that a, = 
ye 4 (‘bj for0 <k <n, then 


k 


be = (-D (5)« j (2.8) 


j=0 
for0O<k<n. 


Proof Let’s make induction on 0 < k < n, noting that the result is obvious if 
k = 0. As induction hypothesis, assume (2.8) to be true when k = / — 1, for some 
1 <1 <n. Fork =1, we get 


I I-1 


0-2 (E(u 


j=0 j=0 
so that, by applying the induction hypothesis and the result of Problem 1, we obtain 
I-1 


l 
by =a (oi 
j=o J 


I-1 


20RO: 
=o Bo“) 


j=0 i=0 


I-1 1-1 


-4 EE 0-()() 


i=0 j=i 


If we now observe that 


the above computations give 


I-1 1-1 


2 EE) 


i=0 j=i 


b 


v- Fou Sn(-) 
a xe 1) (ade Dae (CF '). 
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Finally, since 


l-i ° [-i 


ve )=Pew (7 ')-1-0-pe-1=-1, 
t=0 


t=1 


we get 


i-1 /I I fd 
by = a+ > (-1)* (Ja = ea @z 


i=0 
oO 
We present the promised example next. 
Example 2.13 If dy, stands for the number of derangements of /,,, then 
ae 
ki > ( ay (2.9) 
j=o 


k (bs 


In particular, dy = k! )) jo iI 


Proof As in Example 2.5, let P, denote the set of all permutations of J; and let’s 
count |P;| in two different ways. 

Firstly, Corollary 1.25 gives |Px| = k!. 

Secondly, forO0 < j < k, let F; denote the subset of P, formed by those 
permutations of J; having exactly j fixed points. Then, 


Pr=FoUF,U...UF; 
however, since the union at the right hand side above is disjoint, we obtain 
[Pel = |Fol + [Fil +--+ + Fel. (2.10) 
Now, for f € F;, we can choose its j fixed points in exactly (5) ways; on the 


other hand, the restriction of f to the subset of J, formed by the other k — j points 
clearly forms a derangement of J;_ ;. Hence, the fundamental principle of counting 


gives 
k k 
Fil=(_)ai= (yo Jae 
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It then follows from (2.10) that 


k k 


Fev (2; )aei => (Jar 


j=0 J=0 j=0 


and it suffices to compare both expressions above for |P;| to get (2.9). 
For what is left, we apply the previous lemma to (2.9) (with a, = k! and by = dx) 
to get 


ee | k! 
d = —1 +i ) i! — —] k+j : 
P 2 ei 2 a a, 
k _yyk- k 4) 
=n OP au ( 
sap F-D) jo J 


Problems: Sect. 2.2 


1. * We are given n € N and, for 0 <i, j <n, areal number a;;. Show that 


n oj non 
VY =i ean! 
4 4 aij = 4 4 aij. 


j=0 i=0 i=0 j=i 


2. Use double counting to prove that, for each natural number 7, one has 
+ " + " + + Pies a 
0 1 2 np 
3. Prove Vandermonde’s identity~: given m,n, p € N, one has 
3 m n _ (m +n 
—\k/\p-k} Xp 


with the sum at the left hand side above ranging through all possible nonnega- 
tive integer values of k. 


? After Alexandre-Thedphile Vandermonde, French mathematician of the eighteenth century. For 
another proof of Vandermonde’s identity, see Problem 3, page 355. 


2.3 


10. 
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For each integer n > 1, prove that the number of subsets of Jo, with equal 


quantities of odd and even elements is equal to Cy. 


. A battalion commander asked volunteers to compose 11 patrols, all of which 


formed by the same number of men. If each man entered exactly two patrols 
and each two patrols had exactly one man in common, compute the numbers of 
volunteers and of members of each patrol.* 

(IMO) Given n,k € N, let p,(k) denote the number of permutations of J, 
having exactly k fixed points. Prove that }°7_9 kpn(k) = nl. 


. (TT) The set of natural numbers is partitioned into m infinite and nonconstant 


arithmetic progressions, of common ratios d}, d2, ..., dm. Prove that 
: + : + + : = 
d 1 d2 din 


The result of the coming problem is known as Sperner’s lemma, after the 
German mathematician of the twentieth century Emanuel Sperner. 
A triangle ABC is partitioned into a finite number of smaller triangles, in such 
a way that any two of them are either disjoint, have exactly a common vertex or 
exactly a common side. Such a partition generates a certain number of points 
lying in the interior or on the sides of ABC and which are vertices of smaller 
triangles. We call them vertices of the partition. Randomly label the points lying 
on side AB (and different from A or B) as “A” or “B”; accordingly, randomly 
label the points lying on side AC (and different from A or C) as “A” or “C”, 
and those lying on side BC (also different from B or C) as “B” or “C”. Then, 
label each of the remaining vertices of the partition (all lying in the interior 
of ABC) as A, B or C, also randomly. Prove that at least one of the smaller 
triangles will be labelled as ABC. 
(Russia) A senate has 30 senators, any two of which are either friends or 
enemies. It is also known that each senator has exactly six enemies. If each 
three senators form a commission, compute the number of those commissions 
such that its members are either pairwise friends or pairwise enemies. 
We are given a convex n-gon in the plane. Prove that it is possible to choose 
n — 2 points in the plane such that every triangle formed by three vertices of the 
given n-gon has exactly one of the chosen points in its interior. 
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Let A be a fixed nonempty set, and recall that a relation in A is a subset of the 
cartesian product A x A. 


3For another approach to this problem, see Problem 21, page 134. 
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We say that a relation R in A is reflexive if aa for every a € A; symmetric if, 
whenever a, b € A, we have that aRb => bRa; transitive if, whenever a,b,c € A, 
we have that aRb and bRc imply ac. Finally, an equivalence relation in A is 
a relation ® in A which is simultaneously reflexive, symmetric and transitive. In 
Mathematics, equivalence relations are customarily denoted by ~, instead of R (i.e., 
by writing a ~ b instead of ab); from now on, we shall stick to this usage without 
further comments. 


Examples 2.14 


(a) Let F be a partition of a nonempty set A. The relation ~ in A, defined by 
a~bs ABef;a,beB, 


is of equivalence. We say that such an equivalence relation is the one induced 
by the partition F. 

(b) Let A and B be nonempty sets and f : A > B bea given function. Relation ~ 
in A, defined by 


a~bs f@=f) 


is of equivalence. We say that such a relation is induced by /. 
The following definition is central to all that follows. 
Definition 2.15 Let A be a nonempty set and ~ be an equivalence relation in A. 
For a € A, the equivalence class of a is the subset a of A defined by 


a={xeA:x~a}. (2.11) 


If ~ is an equivalence relation in a nonempty set A anda e€ A, then the 
equivalence class of a is clearly nonempty, for a € a. Our first result analyses 
the equivalence classes of a generic equivalence relation. 


Proposition 2.16 Let ~ be an equivalence relation in a nonempty set A. Fora, b € 
A, one has 


a~bsanb46sa=b. 


In particular, the equivalence classes of ~ form a partition of A. 
Proof We shall show thata~b > @Nb#AVD>S>a4=b>aw~b: 


* Ifa~b, thena € band, hence, a € @N Dd; therefore, aN b HO. 

* Ifceanb (ie. ifanb % &), then c ~ a andc ~ D; hence, the symmetry of ~ 
guarantees that a ~ c and c ~ b and, thus, its transitivity implies a ~ b. We shall 
now show that @ C b (the converse inclusion being totally analogous): if x € @, 
then x ~ a; however, since a ~ b, it follows once more from the transitivity of 
~ that x ~ b,ie., x € b. 
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¢ If@=b, then a € @ implies a € b. But this is the same as saying that a ~ b. 
oO 


If ~ is an equivalence relation in a nonempty set A, it follows immediately from 
the previous proposition that the family A/ ~, defined by 


A/~={a@aeA} (2.12) 


(i.e., the family of the equivalence classes of the elements of A with respect to ~), is 
a partition of A. Such a family is called the quotient set of A by ~, and the function 


Takashio (2.13) 
at a 
is the projection function of A on A/ ~. 
Given an equivalence relation ~ in a nonempty set A, a subset B of A is said to 
be a system of distinct representatives (we abbreviate SDR) for ~ if the following 
conditions are satisfied: 


(a) Foralla,b € B, eithera = bora b. 
(b) For every a € A, there exists b € B such that a ~ b. 


In words, a SDR for an equivalence relation ~ in A is a subset of A formed by 
pairwise nonrelated elements such that every element of A is related to exactly one 
of them. Yet in another way, if B is a SDR for the equivalence relation ~ in A, then 


A/ ~= {%; x € Bh. (2.14) 


The following proposition provides an effective way of constructing SDR for 
equivalence relations. 


Proposition 2.17 Let ~ be an equivalence relation in a nonempty set A. If f : 
A/ ~ — Aisa function such that f (a) € a for every a € A, then Im(f) isa SDR 


for~. 

Proof Let B = Imf.Ifa,b € B, saya = f(@) andb = f(B), with a, B € A, 
then the hypothesis on f assures that a € @ and b ¢€ £. Ifa ~ 5, then, since a ~ a 
and b ~ B, we have by transitivity that a ~ 6; thus, aw = 6 and, hence, 


a= f@) = f(p) =b. 


In order to verify the validity of the second condition in the definition of a SDR, 
take a € A and letb = f(a). Then b € Im(f) and the definition of f gives b € a, 
le.,a~ b. oO 


52 2 More Counting Techniques 


In the notations of the previous proposition, we say that f is a choice function 
for the equivalence relation ~. In words, the role of f is to choose one element out 
of each equivalence class in A/ ~. 

Concerning the role of equivalence relations in counting problems, we have 
finally arrived at the result we are interested in. 


Proposition 2.18 Let ~ be an equivalence relation in a finite nonempty set A. If 
all of the equivalence classes of A with respect to ~ have the same number k of 
elements, then there are exactly |A|/k distinct equivalence classes. 


Proof If {a1,..., am} is a SDR with respect to ~, then A = L_; Gj, a disjoint 
union. Hence, 


m 


m 
Al =o lajl = Dok = kn, 
j=l 


j=l 


so that m = |A|/k. oO 


As a first example of application of the previous proposition, we shall now 
reobtain the formula for the number of k-element subsets of a given set A, of n 
elements (the following reasoning is a slight variation of that in Example 2.8, the 
major change being in the adopted viewpoint). To this end, let 1 < k <n and S;(A) 
be the set of sequences of k distinct terms, all belonging to A. We already know that 


n!} 
|Sx(A)| =n(n—1)...m2—-k4+)= aa 
Let ~ be defined in S;(A) by 
(aj,..., aK) ~ (b1,..., be) > {ay,..., ax} = {b1, ..., dx}. (2.15) 


It’s immediate to check that ~ is of equivalence, and such that the equivalence 
class of a sequence (a1, ..., ax) is the set of all of its permutations. Hence, such an 
equivalence class has precisely k! elements, and Proposition 2.18 assures that the 
number of distinct equivalence relations is equal to 


ISk(A)] nh ) 
k! =a (k 


Finally, note that (2.15) allows us to identify the equivalence classes with the k- 
element subsets of A, so that there are exactly (i) of such subsets. 

Another interesting situation is the following: in the set S(A) of the permutations 
of a finite nonempty set A, we define an equivalence relation ~ in the following 


way: 


(Q1,.-+,4n) ~ (Dj,..., bn) 
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t (2.16) 


aj+k, WIS jsn—k 


di<k<n;b;= : : 
eats Ad ane alo 


In other words, (aj,...,dn) ~ (b1,..., by) means that, for some 1 < k <n, one 
has 


(by, « «5 On) = (G4, Ak425 -+ +s An—1, Ans A, «- +5 Ak). 


It’s immediate to verify that ~ is actually of equivalence in S(A). Moreover, 
for each sequence (a,..., dn) € S(A), its equivalence class (a), ..., d,) contains 
exactly n sequences; more precisely, 


(a1, ++ n) = {(@1, 42...,An—1,4n), (a2, 43, ++, n,Q}) 


(43, ..-,4n, 41, 42),..-, (An, 41, a2, .-., An—1)}- 


With respect to the equivalence relation just described, the equivalence class of 
a sequence (dj,...,4@y) 1s said to be the circular permutation of the sequence 
(a1,..-, 4»). In particular, it follows from the general discussion on equivalence 
relations that all of the sequences 


(a, a2.--,4n-1, an), (a2, a3,--+,4n, a), ae (an, a|,Q2,..., an—1) 


define the same circular permutation. 
With the concept of circular permutation at our disposal, we now have the 
following result. 


Proposition 2.19 The number of distinct circular permutations of a set of n 
elements is (n — 1)!. 


Proof If A is a set having n elements, we wish to count the number of equivalence 
classes of S(A) with respect to the equivalence relation just described. Since 


|S(A)| = n! and each equivalence class has exactly n elements, it follows from 
Proposition 2.18 that there are precisely a = (n — 1)! distinct equivalence classes 
(i.e., circular permutations). oO 


For what comes next, we say that two sets A and B are incomparable with 
respect to inclusion’ if A ¢ B and B ¢ A. The result below brings a deeper 
application of the ideas of this section, and is due to the Hungarian mathematician 
Béla Bollobas. Our discussion follows [25]. 


4This terminology comes from the fact that the inclusion relation is a partial order in the family of 
all subsets of a given set. In this respect, see the discussion at Sect. 4.3. 
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Theorem 2.20 (Bollobas) If F is a family of subsets of I,, pairwise incomparable 
with respect to inclusion, then 


=) a 


Proof Let P be the set of permutations (a1, ...,d@,) of J, such that, for some | < 
k <n, we have {aj,..., ax} € F. Being a set of permutations of /,,, it is immediate 
that |P| <n!. 


We consider in P the relation ~, given by 
(a1,...,4n) ~ (b1,..-,0n) @ {a1,..., ax} = {b1, ..., dx} ef, 


for some integer 1 < k < n. We claim that ~ is of equivalence. Indeed, it is clear 
that ~ is reflexive and symmetric. In order to show that it is transitive, take a third 
permutation (cj,...,C,) in P and assume that {a),..., ax} = {b),..., be} € F 
and {b,,..., bj} = {c1,..., c} € F, for some integers 1 < k,/ <n. Without loss 
of generality, we can also assume that k < /. However, if k < /, then 


{a1,..., ax} = {b1,..., de} C {b1,..., bi} = {e1,...,c1} € F, 


a strict inclusion; this would contradict the fact that the sets in F are pairwise 
incomparable with respect to inclusion. Therefore, k = / and, thus, 


{a1,..., ax} = {h1,..., bk} = {e1,..., cK}  F, 


Le., (4],...,n) ~ (C1,..-, Cn). 

We now fix a permutation (a),...,da,) € P, with {a),...,ax} = A € F. Then, 
(b1,...,0n) ~ (a1,.-.,@) if and only if {b1,..., bg} = A, i-e., if and only if 
(bi, ..., bg) is a permutation of the elements of A (and, hence, (by41,..., bn) is a 


permutation of the elements of J, \ A); since we can permute the elements of A and 
of I, \ A in exactly | A|! and (1 — |A]|)! ways, respectively, we conclude that there are 


exactly |A|!(7 — |A|)! permutations (51, ..., b,) which are equivalent to the fixed 
permutation (a1,...,d,) € P. Yet in another way, if A is the set in F associated to 
(a1,...,4n) € P, then the equivalence class of (a1, ...,@,) in P contains exactly 


|A|!(2 — |A|)! permutations. 

Finally, by Proposition 2.16, the quotient set P/ ~ of equivalence classes of 
P with respect to ~ forms a partition of P. However, since distinct equivalence 
classes are associated to distinct sets A € F and every set A € F gives rise to an 
equivalence class, it follows that 


n=|Pl= > #@,-- dn) = D> IAI — | AID. 
(a1 ,-.,dn)EP/~ AcF 
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This is the same as 


+ Jae BIDE ong 


n! 
AcF 


’ 


which in turn is equivalent to the desired inequality. Oo 


For our next example we first need to recall some simple facts concerning the 
notion of divisibility of integers. Further details can be found in Chap. 6. 

Given a,b € Z, with b ¥ 0, we say that b divides a, or that a is a multiple of 
b, if ¢ is an integer. Letting | = c, this is the same as saying that a = bc, with 
c € Z. For instance, 7 divides 28, for 23 — 4 € Z. If b divides a, we write b | a; 
otherwise, we write b { a and say that b does not divide a. If a is not a multiple of 
b, the division algorithm? shows that there exist unique integers q and r such that 
a=bq-+rand0<r <|D|. 

We now introduce a whole class of equivalence relations in Z, which will reveal 
itself to be of paramount importance to the material of Chaps. 10-20. 

For a given integer n > 1, we say that two integers a and b are congruent modulo 
n if n divides a — b. If a and b are congruent modulo n, we write a = b (modn); 
otherwise, we write a € b(modn). For instance, we have 1 = —5 (mod 3), since 
3 | (. — (—5)), but —3 # 7 (mod 4), since 4 + (—3 — 7). Note that ifa = nq +r, 
with 0 <r <n, thenn | (a —1r), so that a =r (modn). Therefore, every integer is 
congruent, modulo n, to precisely one of 0, 1,...,” — 1. 


Example 2.21 Letn > 1 be a given integer. The relation ~, defined in Z by 
a~b#saz=b(modn), 


is an equivalence relation in Z, which is known as the relation of congruence 
modulo n. 


Proof Reflexivity of ~ follows from 
a~a®&a=a(modn) &n | (a-a), 
which is clearly true. On the other hand, since 
a~bsa=b(modn) =n | (a—b), 
andn | (a—b) &n | (b—a), we have that 


a~b>n|(b-a)>n|(b-a) 
=> b=a(modn) >b~a. 


5For a formal definition of an algorithm, see the footnote at page 116. 
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Finally, for the transitivity of ~, if a ~ b and b ~ c, thenn | (a — b) and 
n|(b—c),ie., ap and poe are both integers; hence, the number 


is also an integer, which in turn guarantees that n | (a — c). But this is the same as 
saying that a = c (modn), so thata ~ c. oO 


For the coming example, given an integer m > 1, aset A C J, and another 
integer 0 < r < m — 1, we let A+, r denote the set 


A+rz={a+r(modm); aé A}. 
Thus, 


A+r=({x+r;xeAandl1<x<m-—r}U 


U {x+r—m;xeAandm—r<x<m}. 
For instance, if m = 7, A = {1, 3,5} andr = 4, then 
A+r= {14+4,3+4+4,5 +4 (mod 7)} = {5, 7, 2}. 


Example 2.22 (Austrian-Polish) Let k and n be given natural numbers, such that 
3{kand 1 <k < 3n. Prove that the set J;, has exactly 1(7?) subsets of k elements, 


such that the sum of its elements is a multiple of 3. 


Solution Let F be the family of k-element subsets of /3,. Consider a relation ~ 
in F be defining A ~ B if and only if there exists an integer 0 < j < 2 such 
that B = A+ j modulo 3. It is immediate to check (cf. Problem 2) that ~ is an 
equivalence relation, and that the equivalence class of a subset A of J, contains 
exactly the three sets A, A+ 1 and A + 2 (these last two modulo 3). 

On the other hand, letting S = )°..4 x, we obviously have 


S> x =S+k(mod3) and YS) x = S$ + 2k (mod3). 
xeEAt+l xE€A+2 


However, since 3 { k, we conclude that the three numbers S, S + k and S + 2k are 
pairwise incongruent modulo 3. But since every integer is congruent to precisely 
one of the integers 0, | or 2, modulo 3, we conclude that exactly one of S, S +k 
and S$ + 2k is divisible by 3. This is the same as saying that exactly one of the sets 
A, A+ 1 and A + 2 has sum of elements equal to a multiple of 3. Hence, there are 
as many k-element subsets of /3, whose sum of elements is divisible by 3 as there 
are equivalence classes of F with respect to ~. But since each such equivalence 
class contains three elements, Proposition 2.18 assures that the number of distinct 
equivalence classes is precisely L¢n). Oo 
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Problems: Sect. 2.3 


1. In each of the items of Example 2.14, prove that the given relations are, indeed, 
of equivalence. Also, identify the corresponding equivalence classes. 

2. * Given k,m €N, define a relation ~ in the family of k-element subsets of [jy 
by letting A ~ B if and only if there exists 0 < j < 2 for which B= A+ j 
modulo 3. Prove that ~ is an equivalence relation. 

3. Show that the number of distinct ways of partitioning a set of ab elements into a 
sets of b elements each is equal to aah 

4. Givenn,k €N, with 1 < k <n, let S(n,k) denote the corresponding Stirling 
number of second kind (cf. Problem 8, page 20) and let A be a finite set with n 
elements. Do the following items: 


(a) Show that every equivalence relation on A, with exactly k equivalence 
classes, can be obtained from a surjective function f : A > B, with|B| =k, 
as prescribed by item (b) of Example 2.14. 

Show that the number of equivalence relations on A having exactly k 
equivalence classes is equal to S(n, k). 

Conclude, from the previous items, that 


(b 


wm 


(c 


wm 


ie (k 
Sa = a Yeni ‘Ja aoe 
7 = J 


For the next problem, recall that a natural number p > | is said to be prime if 
1 and p are the only positive divisors of p. We also recall (see Sects. 6.2 and 6.3 
for details) that if a, b € N are such that p | (ab) but p { a, then p | b. 

5. We are given a prime number p and m pairwise distinct colors. Prove that there 
are exactly m’=" + m distinct necklaces, each of which formed by p beads, each 
bead being of one of the m given colors (there may well be beads of repeated 
colors). Then, use this counting to establish Fermat’s little theorem®: if p is 
prime and m is a natural number which is not divisible by p, then m?-! = 
1 (mod p). 

6. (IMO shortlist) Given n € N, we say that a sequence (x1, x2,...,Xn), with x; € 
{0, 1} for 1 < j <n, is aperiodic if there does not exist a positive divisor d 
of n such that the sequence is formed by the juxtaposition of 5 copies of the 
subsequence (x1,...,Xq). If a, denotes the number of aperiodic sequences of 
size n, prove that n | dy. 

7. Prove the following theorem of Sperner: if F is a family of subsets of [,,, pairwise 
uncomparable with respect to inclusion, then 


© After Pierre S. de Fermat, French mathematician of the seventeenth century. For a different proof 
of this result, see Sect. 10.2. 
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Fs ( " ), 
[n/2| 


with equality if F = {A Cc [,; |A| = Ln/2]}. 

8. This problem generalizes Problem 19, page 31. To this end, letn € N, p be prime 
and f : I, — In be such that f (P) = Td, with f (P) standing for the composition 
of f with itself, p times. 


(a) Show that f is a bijection and, if ~ is the relation defined in J, by 
a~beb= f(a), Aj €Z, 


then ~ is an equivalence relation. 

(b) Each equivalence class of [,, with respect to ~ has either | or p elements. In 
particular, if f has exactly k fixed points, then p | (7 — k). 

(c) Conversely, given 0 < k <n such that n —k = pl, with! € Z,, show that 
for each choice of elements x1, ..., x% € J, and for each partition 


In \ {x1,---,%e} = ALU...U Al 


of I, \ {x1,..., Xx} into pairwise disjoint p-element sets, there are exactly 
(( p- 1)!)' functions f as in the statement, having precisely x1,..., xz as 
their fixed points and satisfying f(A;) = A; for 1 <i </. 

(d) Conclude that the number of functions f : I, — I, such that f ) = Id is 


given by 
n n—k nok n! 
rt Jovy? = ) ee = 
O<k<n k Pricer! O<k<n k}. p? 
p\|(n—k) p\(n—k) 
(n—k)/p 


For the coming problem, recall that a family F of subsets of J, is an 
intersecting system of I, if, for all A, B € F, wehave AN BAG. 
9. The purpose of this problem is to prove the famous Erdés-Ko-Rado theorem 
for positive integers k and n, with 1 < k < n, if F is an intersecting system 
formed by k-element subsets of J,,, then 


ve 


(j), if k > n/2 


els (71), if k<n/2° 


To this end, do the following items: 


7 After the Hungarian, Chinese and German twentieth century mathematicians Paul Erdés, Chao 
Ko and Richard Rado, respectively. 
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(a) Ifk > 3, show that |F| < (7). 


(b) If k < 5, give an example of an intersecting system F of I,, formed by 


k-element subsets of /,, and such that |F| = ‘a 


Also in the case k < os fix an arbitrary intersecting system ¥ of [,, formed 
by k-element subsets of J,,. Let S,, denote the set of permutations o : I, > 
I,, and C, denote the set of circular permutations of J,, so that |S,| = n!, 
ICyn| = (n — 1)! and, foro € S,, we have @ € Cy, with |o| = n (here, 
and as in the text, o stands for the equivalence class of o with respect to the 
equivalence relation defining circular permutations). Define A € F anda € 
Cy as being compatible provided A = {a,,..., ax}, for some permutation 


(a1, ..-, Ak, Ak+1,---,4n) EO. 


(c 


wm 


i. Foro € S,, show that at most k sets in F are compatible with o. Then, 
conclude that there are at most (n—1)!k pairs (A, 7) with A € F,o € Sy 
and A and o compatible to each other. 

ii. For a fixed A e€ F, show that there are exactly k!(n — k)! elements 
o@ € C,, such that A and o are compatible. From this, conclude that there 
are exactly k!(n — k)!|F| pairs (A,@) such that A € F,o € S, and A 
and @ are compatible. 

ili. Complete the proof of Erdés-Ko-Rado theorem. 


2.4 Counting with Metrics 


In this section, we present a counting technique with a distinguished geometric 
flavor. To this end, we shall need the following important definition, which is 
actually a concept pervasive to all of Mathematics. 


Definition 2.23 If X is anonempty set, a metric in X is afunctiond : X x X > R 
satisfying, for all x, y, z € X, the following conditions: 


(a) d(x, y) =d(y, x). 
(b) d(x, y) > Oandd(x,y)=0Sx=y. 
(c) d(x,z) < d(x, y) + d(y, 2). 


The prototype of metric is the distance function in Euclidean space. More 
precisely, letting E* denote the three dimensional Euclidean space and d : E> x 
E> — R be given by 


d(P, 0) = PO, V P,O€ E’, 


then conditions (a), (b) and (c) above are clearly satisfied, condition (c) following 
from the triangle inequality. 

Thanks to the geometric example of the previous paragraph, from now on we 
shall refer ourselves to condition (c) of Definition 2.23 as the triangle inequality. 
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Another piece of terminology borrowed from that example is isolated in the next 
definition. 


Definition 2.24 Let d be a metric in the nonempty set X. Given x € X andr > 0, 
the ball® of center x and radius r is the subset B(x; r) of X, given by 


B(x; r) ={y € X; diy, x) <7}. 


Back to the former example of metric, given P € E* andr > 0 we have 


B(P;r) ={Q@¢€ E?; PQ <r) 


solid sphere of center P and radius r. 


Thus, at least at a heuristic level, this example justifies the geometric intuition 
attached to the name ball. 

In what concerns Combinatorics, our primary interest relies in metrics defined 
on finite nonempty sets. To understand why this is so, let us assume that X is such 
a set and d is a metric on it. Further, suppose that x1,..., xz € X are such that the 
union of the balls centered at those points and with some positive radius r cover X, 
1e., that 


Bix; r)U...U Blxg; r) = X. (2.17) 


If we have an upper bound on the number of elements of any ball of radius r in 
X, say 


IBa;r)| scr), VxE X, 


with c(r) > 0 depending only on r (and not on the chosen x € X), then (2.17) 
gives us 


|X| = |Buy; r)U...U B(xg; r)| 
<|Bousr)|+---+|BOx; 1)| 
<k-c(r). 


Therefore, 


1 
k> ae (2.18) 


8The reader with previous acquaintance with the theory of Metric Spaces will notice that what we 
call a ball is generally known in the literature as a closed ball. Nevertheless, in order to ease the 
writing we will stick to this slightly different terminology. We believe that, as far as these notes are 
concerned, this will be a harmless practice. 
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Thus, an upper bound on the number of elements of any ball of radius r 
automatically gives a lower bound on the number of balls of radius r needed to 
cover X. 

With such a general reasoning at our disposal, let us take a look at a first relevant 
example. 


Example 2.25 Givenn €N, let S, be the set of sequences of n terms, all of which 
equal to 0 or 1, i.e., 


Syn = {a = (41, 42,...,4); G4 =Oorl, VI <i <n}. 


Let d : S, x Sy — R be the function given, fora = (aj,...,d,) and b = 
(bi, ..-, bn) in Sp, by 


d(a,b) =#{1 <i <n; a; Dj}. 


Then, d is a metric on S,,, called the Hamming metric’ on S,,. 


Proof We clearly have d(a, b) > 0 and d(a, b) = 0 if and only if a = D. It is also 
clear that d(a, b) = d(b, a). In order to verify the triangle inequality, note that if 
a,b € Sy, then 


n 
d(a,b) =) > ai — bil. 
i=1 
Therefore, for a, b, c € Sy, the triangle inequality for real numbers gives 


n n 
d(a,c) =)" |a — cil < ) (ai — bil + [bi — cil) 


i=l i=l 


n n 
= Vila — bil + D5 1b - cil 


i=1 i=1 


=d(a,b)+d(b,c). 


In the notations of the previous example, given a € S, andk € N, we have 


B(a;k) ={b € Sy; d(a, b) < k} 


= {b € S,; a; # b; for at most k indices 1 <i <n}. 


° After the American mathematician of the twentieth century Richard Hamming, who used such 
a concept to study coding problems in Computer Science and Telecommunications Engineering. 
The famous Hamming codes were also named after him. 
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However, since there are exactly (ie) ways of choosing j indices 1 < i <n in which 
the sequences a and b are to differ, we conclude that 


n n nN 
|B(a; oi=(6)+({)+--+(f). (2.19) 


The above being said, the discussion that precedes the former example will give 
the coming one. 


Example 2.26 Letn € N and S, be the set of sequences of n terms, all of which 
equal to 0 or 1. Show that if we choose less than 


on 
() GO) FG) 
sequences in S,,, then there will always be at least one sequence in S,, which differs 
from each of the chosen ones in at least k + | distinct positions. 


Proof Choose | distinct sequences in S,, say x1,..., x7, with 


Then, (2.19) and computations analogous to those which led to (2.18) furnish 


|B(x1;k)U...U Boys k)| < |B; A) +--+ + [Bar 4) 


(Cee) 


Now, the fundamental principle of counting gives |S,| = 2”. Therefore, we 
conclude from the last inequality above that the union of balls B(x,;k) U...U 
B(x1; k) does not cover S,. Hence, there exists at least one sequence x € S, such 
that x ¢ B(x;; k), for 1 <i </. But this is the same as saying that 


d(x,xj)>k, Vi<i<l, 


and the rest follows from the definition of the Hamming metric. Oo 


Prior to presenting another relevant example, recall that, given sets X and Y, one 
defines their symmetric difference as the set X AY, given by 


XAY = (XUY)\(XNY). 


In words, X AY is the set formed by the elements which belong to exactly one of the 
sets X and Y. 
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It follows promptly from the definition that XA = X and XAY = YAX; 
moreover, the reader can easily verify that 


XAY =(X\Y)U(Y\X), 


for all sets X and Y. 
The following lemma gathers together other properties of the symmetric differ- 
ence which will be of our interest. 


Lemma 2.27 If X, Y and Z are any sets, then: 


(a) XAY=GOX=Y. 
(b) (XAY)AZ = XA(YAZ). 
(c) XAY=Z&X=YAZ. 


Proof 


(a) Since we clearly have X AX = Q, it suffices to show that KAY =@> X=Y. 
To this end, observe that X AY = @ > X UY = X MY and, hence, 


XCXUY=XNYCY; 
similarly, Y c X. 
(b) It is not difficult to verify that 
(XAYJAZ = (XUYUZ)\ (XNY)U(XNZ)UYNZ)), 


and analogously to X A(Y AZ). We leave as an exercise to the reader the task 
of checking the details. 


(c) Let us show one implication, the other being completely analogous. If X AY = 
Z, it follows from item (b) that 


X =GAX = (YAY)AX = YA(YAX) = YAZ. 


oO 


With the above results at our disposal, we can finally present another important 
example of metric in a family of finite sets, known as the metric of symmetric 
difference. 


Example 2.28 If F is a nonempty family of finite sets, then the function d : F x 
F — R, given by d(X, Y) = |XAY|, is a metric in F. 


Proof Since X AY = Y AX, we have 


d(X, Y¥) =|XAY|=|YAX| =d(Y, X) 
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for all X, Y € F. It is also clear that d(X, Y) > 0 and d(X, Y) = 0 if and only 
if XAY = %; however, by item (a) of the previous lemma, this last equality takes 
place if and only if X = Y. 

We are left to showing that d(X, Y) < d(X, Z) + d(Z, Y) for all X, Y, Z € Ff, 
or, which is the same, that |X AY| < |XAZ|+ |ZAY]. To this end, since |A U B| < 
|A| + |B| for finite sets A and B, it is enough that we establish the inclusion 


XAY C (XAZ) U (ZAY). 
In turn, such an inclusion follows at once from the equality 
(XAZ) U(ZAY) = (X UYU Z)\ (XNYNZ), 


whose proof we leave as an exercise to the reader. Oo 


Fix m € N and consider the metric of symmetric difference in F = P(m), the 
family of subsets of I. If X C [Im is given, let us compute the number of subsets of 
Tn in the ball B(X; k), where k > 0 is a given integer. To this end, start by noticing 
that 


Y € BUX; k) S& |XAY| <k; 


therefore, in order to count the number of elements in B(X; k), it suffices to count 
how many are the subsets Y C J, for which |X AY| < k. Fora fixed Z C I, item 
(c) of Lemma 2.27 guarantees the existence of a single Y C Jj, such that Z = XAY. 
Hence, 


{Y e P(m); |XAY| <k} ={Z € P(m); |Z| < k}. 


Finally, since for each 0 < j < k there are exactly () ways of choosing Z C In 
such that |Z| = j, it follows from the equality of families above that 


|B(X; k)| =#{Z € P(m); |Z| < k} 


_ (7) + (") es ("") (2.20) 


For the coming example, let Px (n) be the family of k-element subsets of J, and 
d be the metric of symmetric difference in P;(n). For distinct X,Y € Px(n), we 
have 


d(X,Y) =|XAY| =|XUY|—|XNY| 
= |X|+|¥|—2|xnY| (2.21) 
= 2k -|XNY)), 


an even number. 


2.4 Counting with Metrics 65 


Example 2.29 (BMO—Adapted) Letn > 2 be a given integer and F be a maximal 
family of 3-element subsets of J, satisfying the following condition: for any A, B € 
F, with A ~ B, we have |AN B| < 1. Prove that 


is ea 


Proof Let d be the metric of symmetric difference in P3(7). It follows from (2.21) 
that, for X, Y € Ff, 


IXNY|<1ad(X,Y)>4. 
We now claim that if F = {A,,..., Ag}, then 


P3(n) = B(Ay; 2) U...U B(Ag: 2). (2.22) 


Indeed, if this was not the case, then there would exist A € P3(n) \ (B(Aq; 2) U 
...U B(Ag; 2)), Le., such that d(A; Aj) => 4 for 1 <i < k. By what we did above, 
this would be the same as having |AM A;| < | for 1 <i < k. In turn, this would 
assure that F’ = {A, Aj, ..., Ag} would be a family strictly containing F and also 
satisfying the stated conditions. This contradicts the maximality of F. 

Thus, it follows from (2.22) that 


n 
( ) = |P3(n)| = |B(A1; 2) U...U B(Ag; 2)| 
3 (2.23) 
< |B(A1; 2)| +--+ + |B(Ag; 2)I- 

Now, given A = {a,b,c} € P3(n), let us compute the number of sets in 
B(A;2) C P3(n): recall that, for A’ € P3(n), the distance d(A’, A) is even; 
therefore, (2.21) gives 

d(A’,A) <2 #d(A’, A) =0or2 
~ A'=Aor |A/NA| =2 
>? A’ = {a, b, ch, {a, b, x}, {a, Cc, x} or {b, Cc, x}, 
for some x € J, \ {a, b, c}. We then conclude that 
|B(A; 2)| = 1+ 3(n — 3) =3n-8. 
We finish by using (2.23) and the last computation above to obtain 
a 1 n _ nn DM —2) 
~ 3n — 8\3 6(3n — 8) 
n(n — 1)(n — 2) 
6(3n — 6) 
n(n — 1) 
18 
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Problems: Sect. 2.4 


1. Prove that the Hamming metric is a particular case of the metric of symmetric 
difference. 

2. (Ireland) Let n > 11 be an odd integer and T be the set of n-tuples (x1, ..., Xn) 
such that xj € {0,1} for 1 < i < n. Givenx = (4,...,%) and y = 
(y1,---, Yn) in T, let 


f@,y) =#{1 <i <n; xj F yi}. 


Suppose that there exists S C T satisfying the following conditions: 


(a) [S| =2"¥. 
(b) For each x € 7, there exists a single y € S for which f(x, y) <3. 


Prove that n = 23. 
3. * Given an integer n > 3, let f(n) be the largest possible number of 4-element 
subsets of J,, such that any two of them have at most two elements in common. 


Prove that 
1 (n F«( ee n 
— < = : 
16\3 eg Ng 


4. * Letn > 2 be an integer and Aj,..., Ag C I, be such that, for 1 <i < j <k, 
we have |A; AA;| > 2r + 1, where r is a given positive integer. Prove that 


Kx 


(+) t-°+0)) 


5. (india) Out of a committee of m members, 1997 distinct subcommittees are 
composed. It is known that, for any two of these subcommittees, there are at 
least five people participating of exactly one of them. Prove that n > 19. 

6. (Brazil) During a summer camp, Arnold and Beatrice talked to each other using 
smoke clouds, sometimes big ones, sometimes small ones. Every day before 
breakfast, Arnold sent to Beatrice a sequence of 24 clouds. As Beatrice not 
always manages to distinguish a small cloud from a big one, they composed a 
dictionary before entering the camp. This dictionary collected n sequences of 24 
sizes of clouds (for instance, one such sequence could be SBSBSBSBSBSBBS- 
BSBSBSBSBS, where B stands for big and S stands for small). For each of the 
n sequences, the dictionary indicated its meaning. Moreover, in order to prevent 
misinterpretation, Arnold and Beatrice avoided including similar sequences in 
the dictionary; more precisely, each two sequences in the dictionary always 
differed in at least 8 of their 24 positions. Show that n < 4096. 


Chapter 3 M®) 
Generating Functions cen 


In this chapter we briefly discuss the powerful method of generating functions, 1.e., 
the use of power series in the analysis of counting problems that are inaccessible by 
the more elementary techniques presented so far. 

Unlike other approaches to this subject (e.g., that of the classic [41]), the material 
collected here makes heavy use of the methods of Differential and Integral Calculus, 
in the level of [3] or [8], for instance. Although working with convergent power 
series apparently requires far more background than using formal power series, it 
is actually much more complicated to rigorously justify the correctedness of some 
algebraic manipulations with this last type of series. Furthermore, we believe that, in 
a first acquaintance with generating functions, the reader with some Calculus back- 
ground feels much more comfortable in working in the realm of convergent power 
series. Anyhow, for the reader’s convenience Sect. 3.2 briefly reviews the basic facts 
on the convergence of power series needed for our subsequent discussions. 

For a comprehensive exposition on the uses of generating functions in Combi- 
natorics, we refer the interested reader to the above mentioned wonderful book of 
professor H. Wilf. 


3.1 Introduction 


By definition, the (ordinary) generating function of a sequence (a;,)y>0 of real 
numbers! is the power series 


) Anx”. 


n>0 


!We shall sometimes refer to the generating function of a sequence (ap ),>1 of real numbers, but 
this should be no source of confusion. 
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Among all generating functions, one of the simplest ones is, in a certain sense, 
that of the (sequence formed by the) nonnegative integers: )°,., x”. As the reader 
probably knows from earlier studies (cf. Section 7.4 of [8], for instance), such a 
power series converges to fa when |x| < 1. Thus, for |x| < 1, we have 


1 


= = ie (3.1) 


n>0 


and say that the series at the right hand side is a geometric series. 

In order to illustrate the kind of use we have in mind for generating functions, we 
consider in this section two preliminary examples. Their role is to give the reader an 
easy glimpse on how the method works. 

The first example revisits Problem 7, page 49, and employs some elementary 
facts on continuous functions, which can be reviewed in Chapter 8 of [8], for 
instance. 


Example 3.1 (TT) The set of natural numbers is partitioned into m infinite and 


nonconstant arithmetic progressions, of common differences d1, dz, ..., dm. Prove 
that 

: + : celia a 1 

di dz din 7 
Proof Let f(x) = ee x” be the generating function of the natural numbers. 


Letting a; be the initial term of the arithmetic progression of common difference dj, 
the stated condition assures that, for |x| < 1, we have 


m 
FR) = lat $ xh 4 gerd go), 


i=l 


Hence, for |x| < 1, it follows from (3.1) (with x% in place of x, in each series in the 
sum at the right hand side above) that 


Multiplying both sides of this last equality by 1 — x and doing some algebra at the 
right hand side, we get 
x 


m 
= 2) (age ee 


(3.2) 
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Now, note that both sides of the equality above define continuous functions in the 
closed interval [0, 1], and our reasoning so far assures that such functions coincide 
in the half-open interval [0, 1). Therefore (and it is precisely here where we use the 
basic theory of continuous functions), they coincide also for x = 1, and evaluation 
of (3.2) at x = 1 gives 


| 


Our next example uses generating functions to obtain a positional formula for the 
n-th Fibonacci number. 


Example 3.2. Let (Fn)n>1 be the Fibonacci sequence and f(x) = De a F,, x" be its 
generating function. For the time being, assume that the series defining f converges 
on an open interval of the form (—r, 7), for some real number r > 0 (we shall see, 
in the paragraph following Proposition 3.7, that this is indeed the case). Then, we 
can write 


f(x) = Fixt+ Fox? + > Fx" 


n>3 


x +x? +9 Fai + Fn—2)x" 


n>3 
=x+x*4+x » Fox! x2 > F,_9x"~? 
n>3 n>3 


=x4+x%74-x(f (x) — Fix) +x? f(x) 


x+(x+x) f(x), 


thus getting 


f(x) _ 
x)= 
; l1—x— x2 
for x € (-r,r). 
Now, writing l-x-x?= (1—ax)(1— Bx), witha, B € R, we havea+ 6 = 1, 
af = —1 and 
x x 


f@) = l—x—x? (l—ax)(1— Bx)’ 
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By imposing, without loss of generality, thata > B, we geta = Lee and B = 
Lee so thata — 6 = /5. Thus, 


_ = 1 I 
FO) = FF l—-ax 1-—fx/)° 


By applying (3.1) (with ax and 6x in place of x) to expand the right hand side 
in power series, we get 


f@= - Yi (ax)" — So(Bx)” 


n>0 n>0 
er eae 
n>1 V5 


It is important to point out that the discussion at the beginning of this section 


guarantees that the geometric series involved in the above computations converge 


for every x € R for which |ax|, |Bx| < 1, i.e., for |x| < ae Hence, letting s = 


min {r. | > 0, it follows that 


Fix" => (fF) (3.3) 


n>1 n>1 


for |x| < s. This way, if the power series representing f is unique, there will be no 
alternative than to infer, from the last equality above, that 


q? — p” 


i 3.4 
a (3.4) 


for every n > 1. 

Finally, the uniqueness result we hinted to above is indeed true, and will be 
explicitly stated in Corollary 3.6. This validates the last step above and shows 
that (3.4) does hold. 


Problems: Sect. 3.1 


1. In the notations of Example 3.1, if a1, a2, ..., Gm are the initial terms of the 
progressions, prove that 
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2. The Lucas sequence (Ly,)n>1 is defined by Ly = 1, Lz = 3 and Lyy2 = Legit 
Lx, for every integer n > 1. Prove that: 


(a) L, = a” + B" for every n € N, where a = ne and 6B = deve 

(b) Loy = L2 + 2(—1)""! for every n € N. 

(c) 1 _ et ey = 4(—1)” for every n € N, where (F,)n>1 denotes the Fibonacci 
sequence. 
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In order to justify and extend arguments like those of the two examples of the 
previous section to more general situations, we shall review in this section some 
facts on the convergence of general real power series. Since a thorough exposition 
of the theory of real power series belongs more properly to the realm of basic real 
analysis, we have chosen to be somewhat sketchy, referring the interested reader 
to Chapter 11 of [8] for a more detailed account of the theory. Later, in Chap. 21, 
extensions to complex power series will be discussed and applied. 

Let r > 0 be a real number. We say that a function f : (—r,r) — R admits the 
(power series) expansion 


fas y ae 


n>0 


in the interval (—r, r), or that the power series )\,,... dnx” converges to f in the 
interval (—r, r), if 


for every x € (—r,r). 
A relevant example is that of the geometric series (3.1). As a corollary we note 
that, for a given a 4 0, we have 


1 ot 
= ) ax (3.5) 
l—-—ax =O 


1 


when |x| < =e Indeed, if |x| < Tal? then |ax| < 1, so that it suffices to apply (3.1) 


to ax in place of x. 
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We shall frequently need to combine the power series expansions of two given 
functions to obtain that of another function, obtained from those by means of the 
arithmetic operations of addition, subtraction or multiplication of functions. In this 
respect, the very definition of convergence of power series assures that, if f(x) = 


enzo nx" and g(x) = 0,59 nx” along (—r,r), then 


(f £ g)(x) = Yon + bn) x" 


n>0 


in (—r, r). Concerning multiplication, we have the following result. 


Proposition 3.3 If the functions f, g : (—r,r) — R have power series expansions 
f(x) = Visco axx* and g(x) = 0 b)x', then the function fg : (—r,r) > R 
has power series expansion (f g)(x) = et Cnx", with 


n 
Ch = = agby = S > agbn—k- (3.6) 
k=0 


k+l=n 


Thus, for |x| < 1 and letting a, = 1 for every k > 0, we get from (3.1) that 


1 1 1 ; : 
=o! (9 pee Se agx Yo ax 


k>0 1>0 


- (> an] y (> ) x 


n>0 \k=0 n>=0 \k=0 
— Yia + 1)x”. 
n>0 


Moreover, such a reasoning can be easily generalized to give the power series 
expansion for the function x > ae for |x| < 1 (cf. Problems 1 and 2, page 78). 
For the purposes we have in mind, the main result on power series is the one 


collected in the coming theorem. 


Theorem 3.4 Let r > 0 be such that the power series )\,.. 4nx" converges at 
every x € (—r,r). If f : (—r,r) > Ris given by f(x) = >, 59 nx", then f is 
differentiable, with f'(x) = > ,54 naynx"—! for every x € (—r,r). 

In words, the previous theorem guarantees that we can termwise differentiate a 


convergent power series, obtaining, as result, the power series corresponding to the 
derivative of the function defined by the original series. 
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Example 3.5 This example shows how to use the previous result to reobtain, for 


|x| < 1, the power series expansion of the function x b> 5. Since — = 


L 
(1—x) 
> +o x” for |x| < 1, differentiating both sides of such an equality we get 


1 


aon = Va = Yin + 1)x”, 


n>1 n>0 


and this is also valid for x € (—1, 1). 


A very important corollary of Theorem 3.4 is the uniqueness of the power series 
representing a certain function. 


Corollary 3.6 If f : (—r,r) — R admits a power series representation in 
the interval (—r,r), then f is infinitely differentiable. Moreover, its power series 
representation is unique and is given by 


f@)= \ az 


n>0 


with dy = af” (0), where f (0) denotes the value of the n-th derivative of f at 
0. 


Proof Firstly, note that f(0) = ao. Now, the previous theorem gives us f’(x) = 
Ss nayx"—! in (—r, r); in particular, this gives f’(0) = a,. Again by the former 
result, f’ is differentiable and we get f”(x) = D°,.,n(n — L)a,x"~*, so that 
f'(O) = 2ap. Proceeding inductively this way, we easily finish the proof. oO 


The actual importance of Theorem 3.4 will completely reveal itself only if we 
have at our disposal some criterion which allows one to establish the convergence 
of a given power series )°,..) dx" in some open interval centered at 0. For our 
purposes, the coming result, which is a particular case of the comparison test for 
series, will suffice. 


Proposition 3.7 Let (an)n>0 be a sequence of real numbers. If there exist positive 
reals c and M such that |a,| < cM” for every n > 0, then the power series 


aes. anx" converges on the interval (<a: in): 
Proof Since 
|anx"| = |ay||x|" < cM" |x|" = c|Mx|" 
and (cf. (3.5)) the geometric series }°,,.) |Mx|" converges when |x| < ua the 


comparison test for series assures that )>,,..9 dnx” converges when |x| < i: Oo 


With the above proposition at hand, we are in position to complete the analysis 
of Example 3.2. Indeed, letting (F;,),>1 denote the Fibonacci sequence, an easy 
induction shows that F, < 2” for every n > 1. Hence, by the previous result, the 
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corresponding generating function, Dae 1 fnx", converges whenever x € (-3. 5 


In turn, this assures that all computations made at that example are actually valid 


whenever |x| < min [3 al = 7 In particular, we can use Corollary 3.6 to 


conclude, from (3.3), that (3.4) does hold. 

As Problem 5 will show, generating functions can be used to solve more general 
second order linear recurrence relations. Actually, when we have complex 
numbers at our disposal, we will completely analyse a much more general version 
of this result, in Chap. 21. 


Example 3.8 As a relevant application of the previous proposition in conjunction 
with Theorem 3.4, let us show that 


1 
f=) —e" (3.7) 


n>0 


for every x € R. 

We first claim that, given A > 0, there exist a > O and no € N such that 
n! > aA” for every n > no. Indeed, fixed a natural number no > A we have, for 
n > no also natural, 


n!=no! (no + 1)...(n—1)n > no!A”” = an’ 


n—no 


It thus suffices to take a = 40°. 


Now, given A > 0, take a > 0 and no € N as above. Letting a, = ! c=tand 


n!? a 
M= i, we conclude that 0 < a, < cM” for every natural number n > no, and the 


previous proposition assures that the power series 


no 


Yar = e+ af 
n=0 


defines a function f : (—A, A) > R (note that + = A). 

However, since A > O was arbitrarily chosen, we conclude that f(x) = 
Yat ax" is actually defined in the whole real line R. Moreover, Theorem 3.4 
guarantees that f is differentiable and such that 


1 1 
fia) = Din aat tl =) x = fe) 


n>1 n>0 


for every x € R. In turn, this gives Z(e* f (x) = 0, so that x  e* f(x) is 
constant in R. Evaluating at x = 0 then yields f(x) = e* for every x ER. 
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For future use, we notice that changing x by ax in the equality of the previous 
example, we get 


a yee (3.8) 


for every x € R. 

A variation of the argument used in Example 3.8 allows us to expand f(x) = 
(1 + x)® in power series when |x| < 1 (and for a fixed real number a ¥ 0). 

To this end, given a € R andn € Z \ {0}, we begin by defining the generalized 
binomial number (“) by letting (5) = 1 and, forn > 1, 


(\-“—oeeee. 3.9) 


n n! 


Lemma 3.9 Givena € Randn €N, we have: 


a=". + Cs) 
(b) GC) (*_ 5), for every a £0. 
(c) |(0)| < 1 whenever |a| < 1. 


Proof 


(a) Is an easy computation: 


— = —a(a — 1)(a—2)...(a—n+1) 
n n n! 


~-@-1@-2)...@=n) 
n! 


1 
= F(a — Ia —2)...(@—n + Ia —(@—n)) 
1 


_ fa- 1 
Nee 1 
(b) Follows immediately from (3.9). 


(c) If |a| < 1, it follows from (3.9) and the triangle inequality that 


II 
a 


(*\/< la|(Je| + 1)(la| +2)... (a) +n —1) 2 ix casen 


n n! n! 
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The coming result is usually referred to as the binomial series expansion or 
Newton’s binomial theorem, and is due to Sir Isaac Newton.” 


Theorem 3.10 (Newton) Fora 4 0 and |x| < 1, we have 


d+n*=)> Ge (3.10) 


n>0 


Proof Let us firstly consider the case 0 < |a| < 1. Since ()x"| < |x|" (by item 
(c) of the previous lemma) and )*,,..¢ |x|” converges when |x| < 1, Proposition 3.7 
assures that ee, Can converges when |x| < 1. Hence, thanks to Theorem 3.4 
the function f : (—1, 1) > R, given by f(x) = Yoyso (%)x", is differentiable, with 


f= >on aye = yao fe (3.11) 
n>1 n>1 


(note that we have used item (c) of the previous lemma in the last equality above). 
The above expression for f’(x), together with item (a) of the referred lemma, gives 


(+xf'@) =a(lt+x) >> ( a 


1 
n>1 


m (Aa parte (oe 


n>1 n>1 


=a 


Therefore, letting g(x) = (1+ x)~* f(x), we get for |x| < 1 that 


g(x) = —a(L +x)! f(x) + 1 tx) f(x) 
= (1+ x)? {af (x) + (1 +2) f'(x)} = 0. 


? As quoted in [8], Newton is considered to be one of the greatest scientists ever, being difficult to 
properly address the scope of his contributions to the development of science. 
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In turn, this means that g is constant in the interval (—1, 1). However, since g(0) = 
1, we get (1 + x)~% f(x) = 1 for |x| < 1, as wished. 

For the general case, assume that (1 + x)* = Lae (° yx" for some a 4 0 and 
every x € (—1, 1). We shall show that analogous formulas hold for @ — 1 anda + 1 
(and every |x| < 1): 


(a) For a — 1: Theorem 3.4 gives, for |x| < 1, 


1 d 1 a 
1 ORE 1 a n-1 
oa a ie are) a Y2(*)s 


n>1 


(b) For a + 1: we have 


a a n a n a n 
(+x) =a+0D(*)s 7 ’ RU) . 


n>0 n> n=0 
SS a a a+l1 
= 1 = y x” 
n>1 n>1 
3 1 
n 
n>0 


A straightforward inductive argument now shows that (3.10) holds for every 
a # 0 and every |x| < 1. 
oO 


Corollary 3.11 For a, B 4 0, we have 


(1+ 6x)" => (*) (Bx)”, 


n>0 


for every x € R satisfying |x| < Al 


Proof It suffices to apply (3.10) for Bx in place of x, observing that |Bx| < 1 <> 
|x| < Te Oo 


As an example of application of the former corollary, for |x| < 5 we have 


(l—2x)"¥? = o(- Ve 2x)", 


n>0 
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with 


_ (-1)" 3 5 2wn-1 
~ nl a. 3 2 
ee (2n)! 
~ nl 2%.2-4---(2n) 
(=1)" (20)! 
~ Al 20. 2p! 
— (=1)" (20 
“qn nj 
Hence, for |x| < 5 we get 
_ 1 /2n 
(1 — 2x) or Oe (3.12) 


n>0 


an equality that will be quite useful in the analysis of Example 3.14. 


Problems: Sect. 3.2 


1. * Generalize the result of Proposition 3.3, showing that if f : (—r,r) > R has 
power series expansion f(x) = ee, ayx", then, for k € N, we have f(x)* = 
e, cnx", where 


= ) Aj Air... Giz 


and the sum extends over all k-tuples (ij,...,ix) of nonnegative integers 
satisfying 4) +--- +i, =n. 
2. * Given k € N and |x| < 1, prove that 


1 k+n= 1), 
aaa EN) 
n>0 


3. * The purpose of this problem is to show that the function f : (—1,1) — R, 
given by f(x) = log(1 + x), admits a power series expansion, as well as to get 
such an expansion. To this end, do the following items: 
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(a) If f(x) = ast anx", use Corollary 3.6 to conclude that ag = 0 and a, = 


alae forn > 1. 
(b) Use Proposition 3.7, in conjunction with Theorem 3.4, to conclude that the 


(-1)""! 
n>1 n 


power series >> 
R. 

(c) Show that g’(x) = a for every x € (—1, 1); then, conclude that f(x) = 
g(x) for every x € (—1, 1). 


x” defines a differentiable function g : (—1, 1) > 


4. * Let (@n)n>0 be a sequence of real numbers such that |a,| < cM”, for some 
c, M > Oandevery n > 0. 


(a) Use Proposition 3.7 to show that, for every x € (- an i): the power series 


an_ynt+l 
Din>0 aaT*” converges, 


(b) If f, F: (<a i) — Rare given by f(x) = )¢,s9 anx” and F(x) = 
Yeo A4x"*1, Theorem 3.4 assures that F’ = f. Use this fact, together 


n> 
with the Fundamental Theorem of Calculus, to show that 


i.e., that we can compute the integral of a function defined by a power series 
by termwise integrating the corresponding series. 


5. Given u,v € R, with v F 0, let (d_)n>1 be such that ay42 = ua + va, for 


every k > 1. Moreover, assume that the equation x* — ux — v = 0 has real roots? 
a and Bf. 
(a) Ifa 4 B, show that a, = Aa”! + BB"! for every n > 1, where A and B 
are the solutions of the linear system meet ‘ 
aA+PBB=a 
(b) If w = B, show that a, = (A+ Bn)a"—! forn > 1, where A and B are the 
solutions of the linear system ula ; 
(A+2B)a = a2 


6. * For x € R \ Z, we define the integer closest to x, denoted [x], as the single 
integer n such that |x — n| < 5. In symbols, 


—= S _— =. 
n = [x] |x lees 


Ifn > 1 is an integer and d, denotes the number of derangements of J, prove 
that d, = [#]. 


3Actually, as will be seen in Chap.21—cf. also Problem 6, page 393—such a restriction is 
unnecessary. 
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In this section, we look at some recursive problems through the eyes of generating 
functions. Such examples were chosen with a twofold purpose: on the one hand, to 
illustrate the diversity of situations to which the method developed in the previous 
two sections can be applied; on the other, to work out some techniques which are 
themselves applicable to other relevant settings. 


Example 3.12 Compute, with the aid of generating functions, the number of 
nonnegative integer solutions to the equation 


aj+agt+:::+a,r=m, 


where k and m are given natural numbers. 


Solution Note that a; + a2 +---+ az = m if and only if x%x®...x% = x”. 
Therefore, there are as many nonnegative integer solutions (a1, a2,..., ax) of the 
given equation as there are ways of getting a summand x” in the product 


feat te¢447 toe) 4e5e $e wll eer ens 
ere———$—<—<—— ——— SS = ———— 
k 


(yet in another way, taking x“! in the first factor, x“? in the second, ..., x“ in the 
k-th factor, we get x” in the product). However, for |x| < 1 we have 


ee I k+n-1)\, 
O=(7 =) “aa n Je 


n>0 


where, in the last equality, we applied the result of Problem 2 of the previous section. 
Hence, the answer to our problem is the coefficient of x” in the above series, i.e., 


oo) (in turn, note that this agrees with the result of Problem 9, page 29. oO 


The method of generating functions is particularly useful in the study of 
sequences (d,)n>0 satisfying recurrence relations more general than those treated 
in the previous section. This is the case, for instance, for the sequence (dy)n>0 
such that a9 = 1, aj = —l anda, = at + 2an—2 for n > 2, since the 
coefficients of the recurrence relation are not constant. In this case, the idea is to 
consider the corresponding generating function, say f(x) = )°,.94nx”, and to 
follow the sequence of steps delineated below: 7 


I. To use the initial values and the satisfied recurrence relation to conclude that 
the generating function converges on some interval of the form (—r, r), with 
r>0. 

II. Again with the aid of the initial values and the given recurrence relation, to 
perform appropriate operations with the series expansion of f in order to get a 
closed form (i.e., a formula) for f (x). 
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III. To expand in power series the formula for f(x) obtained in the previous step. 

IV. To use the uniqueness of power series representation, given by Corollary 3.6, to 
conclude that a, is equal to the coefficient of x” in the power series expansion 
of step HI. 


Note that the steps just described are precisely those which were followed in 
Example 3.2, the first having been concluded right after Proposition 3.7. We shall 
run through them once more, this time to solve the next example. 


Example 3.13 Let (dy)n>0 be the sequence given by ap = 1, aj = —1 and 
an = a + 2an—2, 
n 


for every n > 2. Compute a, as a function of n. 


Solution Let f(x) = )°,,.9 anx” be the generating function corresponding to the 
given sequence. 


Step I: we shall try to apply the comparison test for power series, as given by 
Proposition 3.7. More precisely, assuming that |a,)—2| < @"~* and |dn_1| < 
a”—!, it comes from the triangle inequality, together with the given recurrence 


relation, that 


n—-1 


an-1 = 
lan=1| Aide | 2 —— hg, 
nN 


n 


lan| < 


a n—| = 7 
Therefore, we shall have |a,| < a” provided a + 2o"-2 < q"”, or, which is 
the same, 


a 2 
—+2<a’. 
n 


Since this last inequality is true for a = 2 and every n > 2, and since |ao| < 2° 
and |a;| < 2! it follows by induction that |a,| < 2” for every n > 1. Hence, 
Proposition 3.7 guarantees that f is defined and is differentiable in the open 


: 11 
interval (-3. t). 


Step IT: — writing the given recurrence relation as nay = —dn—|+2nday_—2 forn > 2, 
we have 
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fi = ee me =a,t+ So nanx"| 


n>1 n>2 
-1 
=a,t+ So (-an-1 + 2nan—2)x" 
n>2 
= n—-1 n—2 
=a, — An—|XxX + 2x NAn—2X 
n>2 n>2 


= ay — (f(x) = ap) + 2x | Yn = 2)dn—ax* + 29 7 np? 


n>2 n>2 


=a, +a — f(x) + 2x(f'(x) + 2f()). 
However, since a; + ag = 0, we get f’(x) = (4x — 1) f (x) + 2xf'’(x) or, yet, 
C= Df O)==Ge= D7. 


In order to integrate (i.e., to find the solutions of) the above differential equation, 
note first that f is positive in some interval (—r,7r), for some 0 < r < 5 (this 
comes from the fact that f(0) = ag = 1 > O and f, being differentiable, is 
continuous, hence has the same sign as f (0) in a suitable neighborhood of 0). 
Thus, for |x| < r we can write 


fie) 4x-1_ 1 
FO) Oe 1. 2x —1 


and then, for |x| <r < 5 


x f'(t) dt 
ft) 


0 
- 1 
=— 2+ —  }dt 
[ @+aa) 


1 
= —2x — 2 log( = 2x). 


log f(x) =log fo), = 


Hence, for |x| <r < 5 we have 


fal=e “= 2), (3.13) 


Step III: firstly, recall that the power series expansion of e~** is given by letting 
a = —2 in (3.8), and is valid in the whole real line: 


x (—2)* 
>> 7 xk, 


k>0 
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Therefore, for f given as in (3.13) and |x| <r < 
Proposition 3.3 that 


5, it follows from (3.12) and 


(—2)* 1 (21 
fay =(D a aly 


k>0 . 1>0 

(=2)" 1 721 : 

= Fa) x 
n>0 \k+l=n 

“Ne aap 7 

7 P n (=1)7—# 21 , 

-Z(2 Sans) 
n>0 1=0 


Step IV: comparing the previous expansion with f(x) = ea anx", we get 


n -l 
—1)" 21 
poy = 7), 
4(n—D!\1 
1=0 
oO 
In order to apply the circle of ideas discussed so far to counting problems of 
recursive character, apart from the four steps previously listed one has to execute a 
further one, namely: given a combinatorial problem in which one wants to compute, 
as a function of n € N, the total number a, of possible configurations, one has 
to compute the initial values a, and to get a recurrence relation satisfied by the 
sequence (da,)n>1. Let us carefully examine a relevant example. 


Example 3.14 Compute, as a function of n > 3, the number of distinct ways 
of partitioning a convex n-gon into triangles, by drawing diagonals which do not 
intersect in the interior of the polygon. 


Solution Letting a, be the number of partitions we wish to compute, it is clear that 
a3 = 1. To obtain a recurrence relation valid for n > 4, label the vertices of the 
n-gon as Aj, A2,..., An, consecutively and in the counterclockwise sense. There 
are two possibilities: 


(i) No diagonal departs from A : hence, diagonal A2 A, must be drawn, so that we 
are left to partitioning the convex (n — 1)-gon A2A3...An, with no diagonals 
intersecting in its interior. The number of such partitions is, of course, d,—1. 

(ii) Diagonal A; Ax is drawn, for some 3 < k < n — 1, but no diagonal A, Aj, 
with j < k, is drawn: then, the partition of A; Az... A, induces partitions with 
the stated property in the convex polygons A; A2... Ax and Aj Az Agy1... An 
moreover, in the partition of A; Az... Ax, no diagonal departs from A. 
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Conversely, given any partition of Ay Az... A, with no diagonals intersecting in 
its interior but some diagonal departing from A1, there exists a unique 3 < k <n—1 
such that the partition under consideration comes from exactly one pair of partitions 
of Aj A2... Ax and Aj Ag Ay)... An as in the previous paragraph. 

Now, if k = 3, there is exactly 1 way of partitioning A;A2A3; if 3 < k < 
n — 1, then arguing as in item (i) we conclude that there are exactly ay_, ways 
of partitioning A; Az... A, with no diagonal departing from A;. A simple way of 
unifying these possibilities is to set a2 = 1, and we shall make this convention 
henceforth. 

Since there are dyj—x~42 ways of partitioning A,;A;,Az41...An as desired, 
success applications of the multiplicative and additive principles give exactly 
¥ 723 Ak—1dn—k+2 partitions of type (ii) for Ay Az... An. 

Finally, taking both cases above into account (and recalling that a2 = 2), we 
obtain the following recurrence relation for a,, with n > 4: 


n—1 


An = 4y-1 + > Ak—-14n—k+2 
k=3 


n 
= > Ak—14n—k+2 
k=3 


n—1 
= . Akan—k+1- 
k=2 


For what is to come, it is worth noting that the last expression above also makes 
sense for n = 3, and that if we set a; = 0 we can write it as 


n 
On =P) kdn—-kvt, Vn > 3. (3.14) 
k=1 


As far as we know, there is no simple way to establish the convergence, in some 
open interval centered at 0, of the generating function )°,. , dnx” corresponding 
to the sequence (a,),>1. Nevertheless, it is not difficult for the reader to convince 
himself/herself that the fulfillment of this step is unnecessary if we obtain, in the 
end, a positional formula for a, for which )*,,..; nx” does converge in some open 
interval centered at 0. We shall have more to say on this in a little while. 

Back to the analysis of (3.14), note that the right hand side pretty much resembles 
formula (3.6) for the coefficients of the product of two convergent power series. 
Therefore, setting f(x) = )>,5) nx", we are led to compute f (x): 


fay? = (Dax'| (Par!) = (0 aver) 
k>1 I>1 n>2 \k+l=n 
n—1 n—-1 


 (Saven-t) x= 0 (Tene) 2 


n>2 \k=1 n>4 \k=1 
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Now, (3.14) gives ayn_1 = we Akan—k, SO that the computations above give 


fo SG at) ex =a fea) 


n=4 n>3 


Putting g(x) = paee An42x", we get f(x) = x7g(x), and it follows from the 
equality above that 


x4 g(x)? — x(x*g(x) _ x) = x3 (g(x) — 1), 
ie., xg(x)* — g(x) + 1 = 0. Thus, 
lrwWl—4x _ 2 
2x tel ax 
and the fact that g(0) = a3 = | gives 
lav l=ae 2 
2x ~ 14/1 —4x° 


(Recall that we are assuming that the power series defining f—and, hence, g— 
converges in some open interval of the form (—r,r), for a certain r > 0; 
therefore, for the square roots above to represent real numbers we may assume—by 
diminishing r, if necessary—that r < a 

Finally, we get from (3.12) and Lemma 3.9 that 


1 1/2 i 
g(x) = 5 “El “ 7 


g(x) = 


g(x) = 


_ 1 (-1)""! (2m = 1) sees 
= aa 4n-1 ne Je aye 


n>1 
ys" 2(n — 1) a> 1 2n a 
n\ n-1l n+1\n 
n>1 n>0 
Hence, letting C, = a ) for n > 0, it follows from x 29(x) = = f(x) = 
ene anx" that 
1 2(n — 2) 

= = Cy- 

an ics -( a, ) n—2 


forn > 3. 
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At this point, rigorously speaking we do not know whether or not the above 
formula does solve the problem under consideration, for, in order to obtain 
it, we assumed the convergence of the power series )7,,.; dnx” in some open 
interval centered at 0. Yet in another way, since we cannot guarantee a priori the 
convergence of eS a,x" in some interval of the form (—r,r), with r > 0, the 
whole process above is nothing but a heuristic reasoning that made us infer that 
Cn—2 18, most likely, the correct value for a,. Nevertheless, in order to conclude it 
suffices to show that, letting a; = 0, a2 = 1 and a, = Cy_-2 forn > 3, we have 
An = >) 4-1 GkAn—K+1 for every n > 3. 

For what is left to do, let us start by showing that the power series 7,9) Cux” 
converges in some open interval centered at 0, which is quite easy: since 


1 2n xz 2n <2" an 
n+1\n/J" \n/J~— 


for n > 0, it suffices to apply Proposition 3.7 to conclude that the series converges 


in the interval (- a> 1). Thus, letting 


g(x) = ox _ 2 a ; i GE 


n>0 n>0 


and reverting the previous steps, we conclude that 


1—/1—4x 


g(x) = = 


and, hence, that nee)? — g(x) + 1 =0. Therefore, letting f(x) = x g(x), we get 


f@= Cn42x" = So anx” 


n>2 n>1 


and f(x)* = x(f (x) — x’). Hence, 


(Soanx") 


n>1 n>1 


2 


an equality that, once expanded, furnishes for n > 3 the desired recurrence relation, 


n 
an =) dean ei: 
k=1 


Thus, this shows that the sequence (a,),>1 such that a, = Cy—2 forn > 3 is, 
indeed, the only solution to our problem. Oo 
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Yet with respect to the previous example, we say that 


1 /2 
a ( "| (3.15) 
n+l1\n 


n > O, is the n-th Catalan number.’ Since a, = Cy—2, n > 3, satisfies the 
recurrence relation (3.14), it is immediate to verify that the sequence (C,)n>0 
satisfies the recurrence relation 


n—-1 


C= Ori (3.16) 
k=0 


which is then known as Catalan’s recurrence relation. Problem 5 brings another 
combinatorial situation modeled by Catalan’s recurrence. 

We close this chapter by using generating functions to present an alternative 
proof to Example 5.27 of [8]. In order to do so, we shall need to recall two analytical 
facts, collected below. 


% 1 (—1)"7! 
Lemma 3.15 The series )°,. 1 =z and ye a converge. Moreover, 


n>1 n 
(-1)y""! 1 1 
» n2 = 2 » n° 
n>1 n>1 


Proof The convergence of the series is quite standard, and can be found in most 
Calculus or Introductory Analysis texts (see, for instance, Example 7.39 and 
Proposition 7.48 of [8]). For what is left to do, it suffices to observe that, if 


— 1 
s= Dei ae then 


(—1)"7! 1 1 1 1 
» n2 =a 2) apa sept a= 5 


n>1 n>1 k>1 


The proof of the second result we shall need is considerably more refined than 
those of the other Calculus results used so far. Therefore, we shall limit ourselves 
to state it, referring the interested reader to Theorem 8.2 of [33] or (for a guided 
approach) to problem 7 of Section 11.3 of [8]. 


Lemma 3.16 Assume that the power series )~,.) 4nx" converges on the interval 
(—1, 1). If the number series )~,,..9 Gn also converges, then 


4 After Eugéne Catalan, Belgian mathematician of the nineteenth century. 
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We can finally present the promised example. Apart from some details, we follow 
Chapter 17 of [23]. 


Example 3.17 Let n > 1 be an integer and A = {a}, a2,...,a,} be a set of n 
positive integers, such that the sums of the elements of any two of its nonempty 
subsets are distinct. Prove that 


1 1 1 
—+—4+:::-+— <2, 
a\ a2 an 
Proof We first claim that it is enough to prove that }7y_, x < 2. Indeed, this being 


the case, let A’ = AU 2, with 2% > a, +---+ dy. The choice of N guarantees 
that any two nonempty subsets of A’ also have distinct sums of elements, so that 


For what is left to do, if 0 < x < 1, then, in the notations of the paragraph that 
precedes Problem 12, page 22, the stated conditions assure that 


k 
a SS PO 214) 2 = — 


DASCA jel 


Taking natural logarithms and invoking the result of Problem 3, page 78, we 
successively get 


k 


Y > log(1 agit 2 = [on( l=) 
i=l 


and 


x04 aa 
+ aoe P< ge te, 
3 = 2 3 


Y(#-5 


i=l 
for 0 < x < 1. Dividing both sides above by x, we get the inequality 
k 


> a1 a - xia! ee x ‘ x? . 
x farsa a me pas af aes 
2 3 i 2 3 : 


i=1 


which is valid on all of [0, 1). 
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Now, fix x € (0, 1) and integrate the last inequality above on the interval (0, x) 
to get 


k x : p2ai-1 3-1 x tp 
i + es arf l+>+y5+4+-:: }dt 
» ( 2 3 ~ Jo ( oS 


or, according to Problem 4, page 79, 


k 1 2a iy 3 2 eS x2 ds x3 i 
a a; XxX eee sx oy 32 see, 


t 


At this point, the simultaneous application of the first part of Lemma 3.15 and 
of Lemma 3.16 guarantee that, as x — 1-—, the limits of both left and right hand 
sides above do exist and can be computed termwise. Hence, by taking those limits, 
we conclude that 


ae eae 25 dee a 
ji 2° ~ 2 3 


Finally, the second part of Lemma 3.15 furnishes 


L 


Problems: Sect. 3.3 


1. Use generating functions to solve the recurrence a, = 2 and ays.) = ag +(k+1), 
fork > 1. 

2. The sequence (d,)n>0 is given by dg = | and dy4.1 = 2a, +n forn > 0. In order 
to compute a, in terms of n, do the following items: 


(a) Assuming that a, < a”, conclude that a,+1 < ot th provided a” (a — 2) > 
n. Then, show that a, < 3” for every n > 0. 
(b) Show that the generating function of (d;)n>o converges in the interval 


1 1 ws 1—2x+2x2 
(-4, s) and is given by f(x) = eee 


(c) Find real constants A, B and C such that 
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1 — 2x + 2x? A B C 
= + + 
(—x)2(1-2x) (l1—x)?  1-x  1-2x 


(d) Expand each summand of the right hand side above as a power series to 
conclude that a, = 2”t! — (n + 1) forn > 0. 


3. Given k,m e€ N, use generating functions to compute the number of integer 
solutions of the equation a} +a2+---+a, =m, such that a; > 1 forl <i <k. 

4. Use generating functions to compute the number of nonnegative integer solutions 
of the equation a; + a2 + a3 + a4 = 20, satisfying a, > 2 and a3 <7. 

5. A particle moves on the cartesian plane in such a way that from point (a, b) it can 
go to either (a+ 1, b) or (a, b+ 1). Givenn €N, let a, be the number of distinct 
ways the particle has to go from Ao(0, 0) to A,(n, 2), without ever touching a 
point (x, y) situated above the bisector of odd quadrants (i.e., one such point for 
which y > x). In this respect, do the following items: 


(a) Let Ag(k, k), with 0 < k <n. Prove that there are exactly ayay,_—1~, distinct 
trajectories for the particle in which Aj, is the last point (before A,) on the 
line y = x. 

(b) Conclude that a, = = dkdyn—1— and, hence, that a, = C, forn > 1, 
where C,, is the n-th Catalan number. 


For the coming problem, the reader may find it convenient to read again the 
paragraph that precedes Example 1.15. 
6. Forn € N, we let a, denote the number of partitions of n in natural summands, 
none of which exceeds 3. The purpose of this problem is to compute a, as a 
function of n, and to this end do the following items: 


(a) Show that, for |x| < 1, one has 


—_ 1 
ae = d—xd—x4)(0 — x3) 


n>1 


(b) Find a, b, c,d € R for which 


1 _ a as b re c nN d 
(1—x)(1 — x2)(1 — x3) ~ (d—-x)3) (l—-x)2 1—-x?) 1-x3° 


(c) Conclude that 


+2 
2 
+2 


) 
2) 
) 
) 


= 
n 


(n+1)+;5, if6|n 


= 
n 


(n+1)+4, if2|n but 3}n 
+2 
2 


+2 
2 


= 
n 


+3 
+3 
+t th, if2tn but3|n_ 
+4 


~~~ 
= 
n 


Ae ARE AR Ae 


(n+1), if 2,34n 
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Let a, count a certain number of configurations in terms of n € N. Up to 
this point, we have tried to explicitly compute a, with the aid of the ordinary 
generating function ee anx" of the sequence (an )n>0. Nevertheless, there are 
some instances in which is more adequate to invoke the exponential generating 
function of (a,)n>0, i-e., the power series 


The remaining problems present a few such situations. 

7. As in Example 2.5, let d, denote the number of distinct derangements of J). 
The purpose of this problem is to use the exponential generating functions 
ys dh yn to compute d, as a function of n. To this end, do the following 
items: 


(a 


wm 


Given integers n > 3 andl <k <n —1, let (a1,...,a,) be a derangement 
of {1,2,...,} for which a, = k. 


i. If ax =7n, prove that there are exactly d,—2 such permutations. 
ii. If a, #n, prove that there are exactly d,—1 such permutations. 


(b) Conclude from (a) that d, = (n — 1)(dy—1 + dy—z2) for n > 3. 

(c) Imposing that dy < kla* for every k <n and somea > 0, show that d, < 
nia” if @ > 2. Then, conclude that the series ae, ueygh converges on the 
interval (— 5 5) and, hence, defines a differentiable function f : (— 5 5) > 
R. 

(d) Use the result of (b) to show that (1 — x) f’(x) = x(f(x) + 1). 

(e) Since f(0) = 7 by taking an adequate real number 0 < r < 5 if needed, 
we can assume that f(x) > 0 in (—r, r). Conclude that, in (—r, 7), one has 


Zi (f@)+)= i 
PP = — 


(f) Deduce from (e) that f(x) = —-1+ h-e°. 
(g) Use (3.1), (3.8) and Proposition 3.3 to obtain 


n 


r= (ESP) 


no) 10 


(h) Finally, compute d, as a function of n with the aid of (g) and Corollary 3.6. 


In the previous problem, the convenience of the use of the exponential 
generating function of (dn)n>2 was mostly due (cf. item (c)) to the fact that 
n>? oh yn converges at some interval centered at 0, whereas )°,., d,x” does 
not. Nevertheless, in the coming two problems, the involved combinatorics will 
compel us to use exponential, instead of ordinary, generating functions. 
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8. We want to compute, in terms of n € N, the number s, of sequences of n terms, 
all of which equal to 0, 1, 2 or 3 and having an even number of 0’s. To this end, 
do the following items: 


(a) Show that 


n!} 
Sh y 7 a ee 
(2ao)!a1!a2!a3! 


the sum extending over all nonnegative values of ao, a1, a2, a3 such that 
2a9 +a, +a. +43 =n. 

(b) Set so = 1 and conclude that eee na is the exponential generating 
function of 


2 4 2 


joa (Ge Selle beg en): 


(c) Show that f(x) = 5(e** + eo), and hence that s, = 5(4" + 2"), 


9. One can also use exponential generating functions to compute the number of 
surjective functions between two finite sets (cf. Example 2.6). To this end, given 
m,n € N and finite sets A and B such that |A| = m and |B| = n, do the 
following items: 


(a) Show that the number of surjections f : A — B is equal to the number of 
ways of distributing m distinct objects among n distinct boxes, in such a way 
that no box remains empty. 

(b) Conclude that the number of surjections f : A — B is given by 


y m! 


aj!az!...dy!? 
the above sum extending over all n-tuples (a1, ...,d,) of positive integers 
such that aq] +---+ada, =m. 
n 
(c) If g(x) = (x + a + a +.- -) , use the result of Problem 1, page 78, to 
show that the term corresponding to x” in the expansion of g is given by 


pew x x 
te ee ; 
ai! ay! ay! 


an 


the above sum extending over all n-tuples (a,,...,a,) of positive integers 
such that aj +---+ a, =m. Conclude that the number of desired surjective 
functions is equal to the coefficient of “ in the expansion of g. 

(d) Use (3.8) to show that 


g(x) = (e* _ iy" = > (;) (-1)Fe@-)*, 
k=0 
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Then, expanding e“—)* in power series with the aid of (3.8), conclude from 
item (c) that the number of surjections we want to compute is equal to 


ct({) (n—k)”. 
LOM, 


Chapter 4 m®) 
Existence of Configurations sei 


Generally speaking, Combinatorics deals with two different kinds of problems: 
those in which we want to count the number of distinct ways of making a certain 
choice and those in which we want to make sure that some configuration does 
appear. In order to solve a problem of the first kind above, we employ, among 
others, the counting techniques discussed in the previous chapters. For the second 
kind of problem, up to this moment we have not developed any idea that could be 
systematically used. It is our purpose in this chapter to remedy this state of things. To 
this end, we start by discussing the famous pigeonhole’s principle of Dirichlet, along 
with several interesting examples. Then we move on to some applications of the 
principle of mathematical induction to the existence of configurations. The chapter 
continues with the study of partial order relations, exploring Mirsky’s theorem on 
the relation between chains and anti-chains. We close the chapter by explaining how, 
in some situations, the search for an adequate invariant or a semi-invariant can give 
the final outcome of certain seemingly random algorithms. 


4.1 Pigeonhole’s Principle 


In its simplest version, pigeonhole’s principle, also known as Dirichlet’s princi- 
ple,! can be stated as follows. 


Proposition 4.1 (Pigeonhole’s Principle I) [f we distribute n pigeons into n — 1 
cages (the pigeonholes), then at least one cage will necessarily contain at least two 
pigeons. 


‘After the German mathematician of the nineteenth century Gustav L. Dirichlet. Actually, we 
shall have to wait until Sect. 7.2 (cf. Lemma 7.7) to appreciate the original problem that motivated 
Dirichlet to introduce the pigeonhole principle. 
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Proof By contraposition, if each one of the n — 1 cages was to contain at most one 
pigeon, then we would have at most n — | pigeons. Oo 


As naive as it seems, the pigeonhole principle allows us to establish astonishing 
consequences, as shown by the coming examples. In all that follows, we shall always 
assume that the relation of getting acquainted to someone else is symmetric, i.e., if 
A knows B, then B also knows A. 


Example 4.2 There are n guests in a party. Show that we can always find two of 
them who, within the party, are acquainted with the same number of people. 


Proof Firstly, note that each of the n guests is acquainted with at least 0 and at most 
n — | other ones within the party. Therefore, there are two cases to consider: 


(i) Each guest knows at least one other person in the party: take n — 1 rooms, 
numbered from | to n — 1, and put in room i all of the guests (if any) who 
know exactly i other guests. Since we have n — | rooms (the pigeonholes) 
and n guests (the pigeons), Dirichlet’s principle assures that at least one room 
will contain at least two people. By the way we have allocated the guests in 
the rooms, these two guests know, within the party, the same number of other 
people. 


(ii) There exists at least one “guest” who, actually, has no acquaintances within the 
party: then, no guest knows all of the other n — | people, so that, in this case, 
we can number the rooms from 0 to n — 2 and reason as in (i). Once more, 
pigeonhole’s principle guarantees that at least one of the rooms will contain at 
least two people. Also as in (i), these two people know, within the party, the 
same number of other people. 

oO 


Example 4.3 (IMO Shortlist) Each subset of the set {1,2,..., 10} is painted with 
one, our of 1 possible colors. Find the greatest possible value of m for which one 
can always find two distinct and nonempty sets A, B Cc {1,2,..., 10}, such that A, 
B and A U B are all painted with the same color. 


Solution Let X = {1,2,..., 10}. Given ten colors C,, C2, ..., Cio, let us paint the 
k-element subsets of X with color Cx, for 1 < k < 10. This way, given two distinct 
and nonempty subsets A and B of X, there are two possibilities: either A and B 
have different numbers of elements, and hence were painted with different colors, 
or A and B have the same number of elements; in this second case, A U B has more 
elements than A and B, and thus was painted with a color different from that of A 
and B. Therefore, we conclude that n < 9. 
Now, take only nine colors, and let A; = {1,...,7} for 1 <i < 10. Since 


Ai CA. C::: C Ato 


is a chain of ten distinct and nonempty subsets of X (the pigeons) but we have 
only nine colors (the pigeonholes), Dirichlet’s principle assures the existence of 
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indices 1 <i < j < 10 such that A; and A; are painted with a single color. Since 
A; UA; = Aj, it suffices to let Aj = A and A; = B to fulfill the stated conditions. 
Thus, the largest possible value of n is 9. oO 


There are several interesting generalizations of the pigeonhole principle, and the 
coming result collects the one who is probably the most important of them. For what 
is to come, recall (cf. the paragraph preceding Problem 7, page 41) that the integer 
part of a real number x, denoted |x], is the greatest integer which is less than or 
equal to x. Thus, if |x] =n € Z, thenn < x < n+ 1. It is frequently useful to 
write down such inequalities as 


[Ix] <x < |xJ4+1. (4.1) 


Proposition 4.4 (Pigeonhole’s Principle II) [fn pigeons are distributed among k 


different cages, then at least one cage will contain at least [se | + 1 pigeons. 


Proof Arguing once more by contraposition, let us suppose that each of the k cages 
contains at most pigeons. Then, altogether we have at most k ee | pigeons, 
and it suffices to Sa that, by the left inequality in (4.1), 


n—1 n—1 
k <k =n—-l1<n. 
k k 


Let us examine a few other examples. 


Example 4.5 Prove that, in any group of twenty people, at least three of them were 
born in the same day of the week. 


Proof Let the twenty people be the pigeons and the 7 days of the week be the cages. 
Then, associate a person to a day of the week if he/she was born in that day. The 
second version of the pigeonhole principle guarantees that at least one cage will 


20-1 


contain at least | 2-4 | + 1 = 3 people. Thus, these three people were born in the 


same day of the an Oo 


Example 4.6 (Leningrad*) Each 1 x 1 square of a 5 x 41 chessboard is painted 
either red or blue. Prove that it is possible to choose three lines and three columns 
of the chessboard so that the nine | x 1 squares into which they intersect are painted 
with the same color. 


Proof Since we have used only two colors, the second version of the pigeonhole 
principle implies aia = each column (of five | x 1 squares), one e of the two colors 


Former name of the Russian city of Saint Petersburg. 
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of the 41 columns, take note of its dominant color. Since one out of the two colors 
dominates in each of the 41 columns, by invoking once more the second version of 
the pigeonhole principle we conclude that one of the two colors is the dominant one 
in at least [4 + 1 = 21 columns. Assume, without loss of generality, that such 
a color is red, and call a column as red if red is its dominant color. Then, choose 21 
red columns and, in each of them, choose three | x | red squares. 

Notice that there are exactly () = 10 possible ways of choosing three of the 
five 1 x 1 squares of each column. Therefore, out of the 21 red columns (and once 
again from the second version of the pigeonhole principle), the three | x 1 chosen 


we 21-1 = 
red squares occupy exactly the same positions in at least | aa | + 1=3 columns. 


In other words, we have at least three columns having | x 1 red squares in the 
same set of three lines, and this is exactly what we were asked to prove. Oo 


The next example uses an obvious generalization of the pigeonhole principle 
for the case in which we have a finite number of cages and an infinite number of 
pigeons. 


Example 4.7 (Bosnia) A set A of natural numbers is said to be good if, for some 
n €N, the equation x — y = n has infinitely many solutions (x, y) with x, y € A. 
If N = A; U A2 U---U Ajgo, prove that at least one of the sets A; is good. 


Proof By the first version of Dirichlet’s principle, out of each set of 101 natural 
numbers (the pigeons), at least two will belong to a single one of the 100 sets Aj, 


A2,..., Aigo (the cages). In particular, for each k € N, at least two of the 101 natural 
numbers 101(k — 1) + 1, 101(k — 1) + 2,..., 101k, which we shall call x; and yx, 
belong to a single one of the 100 sets Aj, A2,..., A1oo. 


Assume, without loss of generality, that x, > yx for each k. This way, we 
obtain infinitely many pairs x} > y1, x2 > yo, ...of natural numbers satisfying 
the following conditions: 


(i) the two numbers in each pair are in one of the sets A}, A2,..., A100; 
(ii) 1 < xg — yg < 100 for eachk > 1; 
Gili) i> fj SX > Yi > Xj > Yj. 


Looking at the integers k > 1 as the pigeons and at the 100 sets Aj, A2,..., A1oo 
as the cages, we conclude (by the obvious generalization of the pigeonhole principle 
alluded to above) that there exist an index 1 < m < 100 and an infinite set B C N 
such that 


i€ B> xj, yj € Am. 
We claim that A, is a good set. Indeed, since x; — yj € {1,2,3,..., 100} for 
each i € B, a further application of the obvious generalization of the pigeonhole 
principle guarantees the existence of an integer | < n < 100 and an infinite set 


C C B such that 


ieC>x-y=n. 
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However, since i € C => xj, yj € Am, we conclude from (iii) that the equation 
x — y =n admits infinitely many solutions in A,,. Oo 


As our next application of Dirichlet’s principle, we shall prove a particular case 
of a famous result of the Hungarian mathematicians Paul Erdés and Esther Szekeres 
(cf. [15]), which is known in mathematical literature as Erdés-Szekeres theorem 
(for another proof of it, see Sect. 4.3). 


Example 4.8 (Erdés-Szekeres) Out of any sequence of n? + 1 distinct real numbers, 
one can always find a monotone subsequence of at least n + | terms. 


Proof Let A = {x1,x2,...,X,24,} be the set whose elements are the n2 +1 
terms of the given sequence, and assume that the given sequence does not possess 
any increasing subsequence with at least n + 1 terms. Then, every increasing 
subsequence starting at a certain x € A has at most n terms. Define 


f:An- {1,2,...,n} 
by letting f(x) = size of the largest increasing subsequence of the given sequence, 


starting at x. Since A has n? + 1 elements, the pigeonhole principle guarantees that 
we should have at least 


2 
1)-1 
je | oy eee es | 
n 
elements of A with the same image, say X;,, Xi),...,Xi,,;, With ij < ig <--- < 


intl. If xi, < xj, for some 1 < j </ <n+1, we would have f %i;) > f (xi,), for 
we could enlarge an increasing sequence starting at x;, by placing x;, before it. But 
since f (x; 9) > f(xi,) 1s impossible, we conclude that 


Xiy > Xin ee ae > Xingy: 


Thus, we have found a decreasing subsequence of n + 1 terms. oO 


We end this section by presenting four additional examples which illustrate the 
wide versatility of applications of Dirichlet’s principle. 


Example 4.9 (South Korea) For each positive integer m, prove that there exist 
integers a and b such that |a|, |b] < m and 


1 
0<at+bv2< + v2 
m+ 


2 
7" 


Proof Let A be the set of positive reals of the form a+ b./2, with 0 <a,b<mand 
a and b not both equal to 0. Since x; + yyv2 =x2+ yoV/2, with x1, x2, y1, y2 € Z, 
implies x; = x2 and yj = y2, we conclude, with the aid of the fundamental principle 
of counting, that A has exactly (m + 1)? — 1 = m? + 2m elements, the largest one 
being m(1 + V2). 
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Leta = nee and J; = (VG — la, ja], with 1 <j < m? + 2m. The union of 
the /;’s is the interval (0, m(1 + /2)], which contains A. If one of the elements of 
A belongs to J, there is nothing left to do. Otherwise, there will be left m?+2m—1 
intervals to contain all m? + 2m elements of A, and the Dirichlet principle assures 
that at least one of these intervals, say J,, contains at least two elements of A, say 
xy yyv2 and x2 + y2ovV/2, with xj + yyv2 <xo+ yoV/2. Now, since J; has length 
a, we get 


0 < (x2 — x1) + G2 — yi) 2 = (2 + yoV2) — (21 + 1 V2) < 


Finally, noticing that |x2 — x1|, |y2 — y1| < m, we conclude that it suffices to let 
a=x2—x, andb= y2— yj. oO 


The next example uses the existence part of the Fundamental Theorem of 
Arithmetic (Theorem 6.43), together with the notation of congruence modulo 2 
(cf. Example 2.21). 


Example 4.10 (IMO) Let A be a set of 1985 positive integers, none of which has 
a prime divisor larger than 26. Show that it is possible to find four elements of A 
whose product is a fourth perfect power. 


Proof The given conditions assure that every x ¢€ A is of the form x = 
QM1x 32x | 23%, with a1, a2, ..., a9 € Z4. Modulo 2, the number of distinct 
9-tuples (a1, ..., @9) is equal to 2? = 512 (for each exponent aj is either odd or 
even). Since 1985 > 512, pigeonhole’s principle assures that we can choose x1, y1 € 
A such that a;,, = ajy,(mod 2) for 1 < i < 9. Hence, ax, + a@jy, = O(mod 2) for 
1 <i < 9, so that x1y) = ae for some z; € N. Since 1985 — 2 > 512, by 
invoking Dirichlet’s principle once more, we can choose x2, y2 € A \ {x1, y1} such 
that x2y2 = za, for some z2 € N. Continuing this way, we obtain pairwise distinct 


: 2 
elements x1, y1, x2, Y2,.-.,%513, y513 € A, with xjyj = Le for some z; € N and 
every | <i < 513. 
Now, since the prime factorisations of z1, Z2,..., 2513 also contain only primes 


less than 26 and 513 > 512, an additional application of the pigeonhole principle 
guarantees the existence of indices 1 < k <7 < 513 such that zz; = w’, for some 
w € N. Hence, 


Doe 2 22 4 
XKVEXIV) = 2p = (Zz) = (Cw) = w"” 


oO 


Example 4.11 (IMO) Let x1, x2,...,X, be real numbers for which ae + ac ferret 
a = |. Prove that, for every integer k > 2, there exist integers a1, a2, ..., dn such 
that at least one of which is nonzero, |a;| < k — 1 for 1 <i <n and 


(k—DJa 


Jaixi + apx2 + +++ + Gn%nl S S| 
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Proof Firstly, if (a, a2,..., 4G) is one of the k” sequences of integers satisfying 
0 <a; <k-—1forl <i <n, then Cauchy’s inequality (cf. Section 5.2 of [8], for 
instance) gives 


Now, subdivide interval [0, (k — 1)./n] into k” — 1 disjoint subintervals, 
(k=DJn_ 


each of which having length —,=;—. Since there exist k” distinct sequences 
(a1, 2,...,4,) of integers, pigeonhole’s principle guarantees that at least two of 
such sequences, say (bj, b2,..., bn) and (ci, c2,..., Cn), are such that both of the 
sums )~_, bjx; and )~"_, cjx; belong to a single one of those k" — 1 subintervals. 
Hence, 


Do, —j)x)| = oy = Der|= < ee 


and it suffices to let aj = b; — c; for 1 <i <n. oO 


Our final example is a problem with two parts, in which only the easiest one 
makes use of Dirichlet’s principle. Nevertheless, insight from such a use gives a 
valuable clue for an effective approach to the second part. 


Example 4.12 (EKMC) Let k > 3 and n be integers such that n > (5 y Prove that if 
aj, bj, c; (1 < i <n) are 3n distinct real numbers, then one can find at least k + 1 
distinct numbers among the 37 sums a; + bj, a; + c;, bj + c;. Show also that this 
claim is not necessarily true ifn = (*), 


Proof By contradiction, assume that we have at most k distinct numbers among the 
3n sums aj + bj, a; + ¢;, bj + C7, SAY X1, X2,.--, XK. 

Since a; + b;, aj + cj; and b; + c; are pairwise distinct, (5) is the number of 
3-element subsets of {x,,.x2,...,xg} andn > (5), we conclude that there exist 
1 <i < j <n for which 


{a; + bj,a; +¢;,b; +o} = {aj + bj, a; +¢;,b; + cj}. 


But it is not difficult to verify that such an equality would imply that at least one 
element of {a;, bj, c;} would be equal to at least one element of {a;, bj, cj}, which 
is an absurd. 

For a counterexample when n = (5). we want to have only k distinct numbers 
among the 3m numbers a; + b;, aj + cj, bi} + cj, even though the 37 numbers a;, 
b;, ci are still pairwise distinct. To this end, first note that, by arbitrarily choosing k 
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distinct real numbers x1, x2,...,.x%, we can formn = (5) subsets of {x1, x2,..., Xx}, 
of three elements each, and choose an arbitrary bijection between the sets 


k 

[ai + bia: +¢,b)4+¢; lSisn= (3)| 

and 
{xy tx, +x; 1<r<s<t<k}. 

Once we have done that, we would have, for 1 <i < j <n, 

ai t+bj =Xq;, ai +c = Xp, Di +c; = Xy, 
and 

aj + bj =Xaj> aj + Cj = Xg;; bj + Cj = Xy;, 

where 1 < aj, a@;, Bi, Bj, Yi, vj < & are such that a;, B;, yj; are pairwise distinct, 
the same happening with a;, Bj, y;. 

Facing each of the two 3-tuples of equalities above as a linear system of 
equations, we can compute aj, b;, c; (resp. aj, bj, cj) in terms of Xq;, xg, Xy; (resp. 
Xap» XB» Xy; ), getting three distinct numbers in each case (for xq,, xg;, Xy; resp. 
Xj» Xpj> Xyj;—are pairwise distinct). 

Therefore, the central point to the construction of a counterexample is that we 
ought to have 

{aj, bi, ci} N {aj, bj, cj} =G 
or, which is the same, 
(Xeq + Xp) — Xyj1 Xa + Xy;, — XB» XB; + Xj — Xay} 
and 
(Xa; + XB; — XyjoXeey + Xyy — Xj %Bj + %yj) — Xeej} 
to be disjoint. 
It is thus enough to show that it is possible to choose x1, x2, ..., x% in such a way 


that 


Xp t+ X53 — Xp F Xy + Xy — Xw 
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whenever r, s, ¢ and uw, v, w are two 3-tuples of distinct integers of the interval [1, k], 
with {r,s,t} A {u, v, w}. Since we may have t € {u, v} or w € {r,s}, a possible 
choice is to take 


: x = 23, x3=2°,..., ee ee 


4H 2 
The uniqueness of binary representation of naturals (cf. Section 4.1 of [8], for 
instance) assures that the above conditions will be satisfied, even if we have 
t € {u, v} or w € {r,s}. oO 


Problems: Sect. 4.1 


1. We mark at random five points inside a square of side length 2. Show that it is 
always possible to find two of these five points whose distance from one another 
is at most V2. 

2. Show that if we choose 800 points inside a cube of edge length 10, at least one 
of the line segments determined by two of these points will have length less 
than 2. 

3. Let x € R and n be an integer greater than 1. Show that, among the numbers x, 
2x,...,(n — 1)x, there is at least one whose distance from an integer is at most 


SIR 


The coming problem brings yet another useful version of the pigeonhole 
principle. In words, it says that at least one of a number of given pigeonholes 
will contain at least as much pigeons as their average value. 

4. * We are given n pigeonholes and a positive integer m. We place a, pigeons in 
the first pigeonhole, az pigeons in the second, ..., a, pigeons in the n-th. If 


a 4a2++++ + an 
n 


> (resp. >) m, 


prove that at least one of the pigeonholes will contain at least, m (resp. m + 1) 
pigeons. 

5. The natural numbers from 1 to 15 are distributed around a circle. Show that it 
is always possible to find five consecutive numbers whose sum is greater than 
or equal to 40. 

6. We are given two equal disks A and B, each of which is divided into 200 equal 
sectors, which, in turn, are painted either black or white. In disk A, there are 
100 white sectors and 100 black sectors, in an unknown order. In disk B, we do 
not know how many sectors are white. Prove that it is possible to place disk A 
directly above disk B in such a way that at least 100 sectors of disk A lie above 
sectors of disk B, whose colors match theirs. 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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The next eight problems use elementary facts on the notion of divisibility of 
integers. Before trying to solve them, the reader might find it convenient to run 
through the statements of the definitions and results of Chap. 6. 


. Prove that in any set of 52 integers, there exist at least two whose sum or 


difference is a multiple of 100. 


. Prove that every natural number has a nonzero multiple whose decimal 


representation contains solely the digits 0 and 1. 
(TT) Prove that every natural number has a multiple whose sum of digits is odd. 


. Show? that, if a and n are relatively prime natural numbers, then at least one of 


the numbers a, a*,...,a"—! leaves remainder 1 upon division by n. 

We are given a set A = {aj,a2,...,dn}, of n > 1 positive integers. Prove 
that there exist natural numbers k and / such that 1 < k < / <n and the sum 
ap + agai +--+ +a) is a multiple of n. 

Show that, by arbitrarily choosing n+ 1 elements from J, we shall necessarily 
choose two such that one divides the other. 

(Iran) Let A be a set of 33 natural numbers, each of which has all of its prime 
factors among the prime numbers 2, 3, 5, 7, 11. Prove that there exist two 
distinct elements of A whose product is a perfect square. 

(Russia) We have chosen 15 pairwise relatively prime elements from the set 
{1,2,..., 1998}. Prove that at least one of the 15 chosen elements is necessarily 
a prime number. 

Given a set A of ten positive integers of two digits each, prove that it is always 
possible to obtain two nonempty disjoint subsets of A having the same sum of 
elements. 

(India) A set of real numbers is free of sums provided no two of its elements 
(not necessarily distinct) are such that their sum equals another element of the 
set. Find the greatest number of elements that a free of sums set may have, 
knowing that it is a subset of A = {1,2,3,...,2” + 1}. 

Show that, by choosing n + | elements from the set {1,2,..., 37}, we have 
necessarily chosen two elements x and y such that xy + 1 or 4xy+ lisa 
perfect square. 

For each positive integer n, find the least positive integer f(m) such that, for 
every partition of the set {1,2,..., f(m)} into n sets, there exist integers a > 0 
and 1 <x < y for whicha+x,a+yanda+x-+y all belong to a single one 
of the n sets of the partition. 

(USA) 


(a) Each | x 1 square of a 3 x 7 chessboard is painted either white or black. 
Prove that, independently of the way we have painted the squares, there 
will always be four squares of the same color, which are the corner squares 
of a rectangle with sides parallel to those of the chessboard. 


Let y(n) denote expression (2.7), deduced at Problem 6, page 40, and let a € Z be relatively 
prime with n. In Sect. 10.2, we shall prove a theorem of L. Euler which assures that a?) always 
leave remainder 1 upon division by n (in the language of Sect. 2.3, a? = 1(modn)). 


4.2 Induction and Existence of Configurations 105 


(b) Exhibit a way of painting the squares of a 4 x 6 chessboard so that the 
condition of item (a) is never satisfied. 


20. (IMO) Seventeen people discuss one of three possible subjects by letter. More 
precisely, each two of the seventeen people exchange a letter in which exactly 
one of the three possible subjects is touched upon. Prove that there are three of 
them who exchange letters on the same subject. 


4.2 Induction and Existence of Configurations 


In this section, we examine some examples that illustrate how mathematical 
induction can be used, in combinatorial situations, to establish the existence of 
configurations possessing certain properties. We assume from the reader a thorough 
acquaintance with the principle of induction, referring to Section 4.1 of [8] for 
an elementary exposition of the necessary background. In particular, most often 
we shall not write formal inductive proofs, i.e., sometimes initial cases will not be 
checked and induction hypotheses will not be explicitly stated. 


Example 4.13 (Germany) Given n € N, prove that one can assemble a square of 
side length 2” by using one square piece of side length 1 and several L-triminoes, 
i.e., pieces of the shape below, where each small square has side length equal to 1: 


Proof We shall use induction on n to show that each square of side length 2” can be 
assembled as required, with the square piece of side length 1 occupying a corner of 
the 2” x 2” square. 

For n = 1, the assembling is immediate, as shown in the figure below: 


By induction hypothesis, assume that every square of side length 2‘ can be 
assembled as prescribed. For the inductive step, take a 2+! x 2+! square and 
divide it into four squares of side length 2‘, as shown in the figure below. Then, 
place an L-trimino in the center of the larger square, so that it occupies one square 
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of side length 1 in three of the four squares of side length 2‘. Finally, place the 1 x 1 
square piece in the corner of the 2‘t+! x 2*+! square that belongs to the square of 
side length 2 which was not intersected by the L-trimino already placed. 


The placement of these two pieces has the effect of letting each of the four 
squares of side length 2 with exactly one corner occupied by a square piece of 
side length 1. Hence, induction hypothesis assures that we can finish the assembling 
of each one of these four squares by using L-triminoes only, and taking these lets us 
conclude that the same is true for the 2+! x 2'+! square (with the square piece of 
side length 1 placed in one of its corners). oO 


As one should expect, the coming example shows that, sometimes, strong 
induction is in force. 


Example 4.14 (Leningrad) List all nonempty subsets of the set {1, 2, ...,} which 
do not contain consecutive elements. For each one of them, compute the square of 
the product of its elements. Prove that the sum of all of these numbers is equal to 
(n+1)!—1. 


Proof As was anticipated above, we shall make strong induction on n > 1. 

If n = 1, the only nonempty subset of {1} is itself, so that the required sum is 
1=2!-1. 

By induction hypothesis, suppose that, for 1 <n < k, the desired sum is equal 
to (n+ 1)! — 1. Now, notice that the nonempty subsets of {1, 2,...,k + 1} without 
consecutive elements can be divided in two categories: 


(i) the nonempty subsets of {1, 2, ..., &} without consecutive elements; 
(ii) the sets of the form AU {k+ 1}, where A is a subset of {1, 2,..., k — 1} without 
consecutive elements. 


The induction hypothesis guarantees that the sum of the squares of the products 
of the elements of the sets of type (i) equals (k + 1)! — 1, while that of the sets of 
type (ii) equals (A + 1)2(k! — 1) + (k + 1)? (the summand (k + 1)? corresponding 
to the set {k + 1} = @U {k + ]}). 
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Hence, the sum of the squares of the products of the elements of all desired 
subsets of {1,2,...,& + 1} equals 


(kt D!—14(k+1°R!-D+Kk41% = 
=(k+)!—14+(k+1)’-k! 
=(kK+1I!—-1+ K+ D!1K+D) 
=(k+ IMk+2)—-1 
=(k+2)!—1. 


Oo 


We now give two examples that show that an inductive argument can compose 
just part of the analysis of a certain combinatorial situation. 


Example 4.15 (Russia) Letn > 1 be an odd integer. In open field, n children are so 
positioned that, for each one of them, the distances to the other n — 1 children are 
pairwise distinct. Each child has a water pistol and, at the sound of a whistle, fires 
at the child closest to him/her. Show that at least one child will remain dry. 


Proof Firstly, let us look at the case n = 3. Let A, B and C be the children and 
assume, without loss of generality, that AB < BC < AC. Then, A fires at B and B 
at A, so that C remains dry. 

Now, assume that whenever 2k — 1 children (k > 1) are positioned in open field 
as prescribed in the statement of the problem, at least one of them remains dry. We 
then consider 2k + | children, also positioned as required by the problem. Since the 
distances between pairs of children are pairwise distinct, there exist two children, 
say A and B, such that the distance between them is the smallest one of all distances 
between two of the given children. Thus, A fires at B and vice-versa. Discarding A 
and B, we are left with 2k — 1 children, and there are two distinct possibilities: 


(i) One of these 2k — 1 children fires at A or B: in this case, at most 2k — 2 shots 
are fired towards some of the remaining 2k — 1 children (except A and B). By 
the pigeonhole principle, at least one of them remains dry. 

(ii) None of the remaining 2k — 1 children fires at A or B: by induction hypothesis, 
at least one of these 2k — | children remains dry. 
oO 


Example 4.16 (IMO Shortlist) We wish to write one of the numbers 0, | or 2 in 
each entry of a table of 19 lines and 86 columns, in such a way that the following 
conditions are satisfied: 


(a) Each column has exactly k zeros. 
(b) For any two chosen columns, there exists a line such that the two entries of this 
line situated at the chosen columns are, in some order, equal to | and 2. 


Find all values of k for which this is possible. 
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Solution We say that a set of m lines distinguishes n columns if, for any choice of 
two of these n columns, it is possible to choose one of the m lines in such a way that 
the entries of this line situated in the two chosen columns are equal to 1 and 2, in 
some order. 

We claim that m lines distinguish at most 2” columns. Indeed, it is clear that 
a single line can distinguish at most two columns. Now, assume that j lines 
distinguish at most 2/ columns. Then, since a single line distinguishes at most 
two columns, j + 1 lines will distinguish at most two groups of 2/ columns (the 
distinction made by the (j + 1)-th line between the two groups of 2/ columns is a 
permutation of the pattern 


Vee 222s 
—_—Oor eo” 
QI QI 
Hence, j + 1 lines will distinguish at most 2/ + 2/ = 2/+! columns. 

Back to the given table, since 86 > 2°, the above claim assures that we need 
at least 7 lines to distinguish all of the 86 columns. On the other hand, 7 lines will 
suffice, for 86 < 27. Actually, it suffices to choose 86 distinct sequences of 7 terms, 
each of which equal to 1 or 2, and fulfill the first 7 lines with the terms of these 86 


sequences. The remaining lines can be randomly fulfilled. 
Therefore, we must have 0 < k < 19—7= 12. oO 


For the coming example, recall that, givenn € N, we let J, = {1,2,...,n}. Also 
recall that, given sets A and B, their symmetric difference AAB is given by 


AAB = (AUB)\(ANB). 


Example 4.17 (Argentina) Givenn € N, let P, be the family of all subsets of J,. If 
Ff : Pn — In is any function, show that there exist distinct A, B € P, and such that 


f(A) = f(B) = max(AAB). 
Proof Forn = 1, we have f : {@, {1}} > {1} and @ A{1} = {1}, so that 
f@) = fl} = 1 = max({1}). 


Suppose that the statement is true for n = k, and take a function f : Pe41 > 
Ix. There are two cases to consider: 


(i) f maps all sets in Px to elements of Jj: in this case, it suffices to apply the 
induction hypothesis to the restriction of f to Ix. 

(ii) There exists A € Px such that f(A) = k+ 1: let F C Pry be the family of 
the elements of Px containing k + 1, ie., 


F ={XU{k+ 1}; X © Py}. 
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We look at two subcases: 


¢ there exists B ¢ F such that f(B) = k+ 1: then, k +1 € B \ A, so that 
A# B,k+1¢€ AAB and 


f(A) = f(B) =k+1= max(AAB). 


e f maps into J: let g : Py — I, be given by setting g(X) = f(XU{k+1}), 
for every X € Px. By induction hypothesis, there exist distinct A’, B’ € Px, 
such that 


¢(A’) = 9(B’) = max(A’ AB’). 


Since (A’U{k+1}) A(B’U{k+1}) = A’AB’, we conclude that f (A’U{k+1}) 
and f (B’ U {k + 1}) are both equal to max ((A’ U{k+ 1) ACB’ U {kK + 1})), 
with A’ U{k + 1} 4 B/U{k + 1}. 

Oo 


We finish this section by revisiting Problem 17, page 31, this time with the aid of 
mathematical induction. 


Example 4.18 (Saint Petersburg—Adapted) We are given 2n + | points on a circle, 
such that no two of them are the endpoints of a diameter. Prove that, among all 
triangles having three of the given points as their vertices, at most 


n(n + D)Qn +1) 


of them are acute. 


Proof For n = 1 there is nothing to do, since three points on a circle determine 
exactly one triangle. 

By induction hypothesis, assume that the problem is true when n = k — 1, for 
some integer k > 2. Then, letn = k, so that we have 2k + | points on the circle. 
Out of them, take all pairs of points such that the chord joining them leaves k points 
in one arc and k — 1 points in the other arc of the circle. Among all such pairs of 
points, let A and B be two such that AB is at minimum distance from the center O 
of the circle (if there is more than one pair of points with this property, choose any 
one of them). 

Now, there are three kinds of acute triangles having vertices at three of the 2k + 1 
given points: 


(i) Those having A and B as two of their vertices: since O must belong to the 
interior of such a triangle, and AB leaves k points at one side and k — 1 points 
at the other side, there are at most k such triangles. 

(ii) Those having A or B, but not both of them, as one of their vertices: let C and 
D be the other two vertices of such an acute triangle (so that the third one is 
either A or B). We consider two subcases: 
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¢ C and D are at the same side of AB: if ACD was to be acute, then AC or 
AD would leave B and O at different half-planes. In turn, this would assure 
that the distance from AC or AD to O would be less than the distance of 
AB to O, thus contradicting our choice of A and B. Therefore, AC D is not 
acute and, accordingly, the same is true of BCD. 

¢ C and D are in opposite sides with respect to AB: since AC BD is convex, 
at most one of the triangles AC D or BC D contains the center O, and hence 
at most one of such triangles is acute. On the other hand, by the choice of A 
and B the set {C, D} can be chosen in exactly k(k — 1) ways, and this is the 
largest possible number of acute triangles of this kind. 


(iii) Those having neither A nor B as one of their vertices: by induction hypothesis, 
there are at most a (k — 1)k(2k — 1) such triangles. 


Adding the contributions of the three cases above, we conclude that the number 
of acute triangles having its vertices at three of the 2k + 1 given points is at most 


k+k(k—1)+ tk ash) 6) ae Fk(k + 1)(2k +1). 


Problems: Sect. 4.2 


1. (India) Prove that, for every n > 6, every square can be partitioned into n other 
squares. 

2. (Brazil) Given a natural number 1 > 1, we write a real number of modulus less 
than | in each cell of ann x n chessboard, in such a way that the sum of the 
numbers written in the four cells of any 2 x 2 square is equal to 0. If n is odd, 
show that the sum of the numbers written in the n? cells is less than n. 

3. (TT) Point O is situated in the interior of the convex polygon Aj A2... An. 
Consider all of the angles 4A;O Aj, with distinct 1 < i, j < n. Prove that at 
least n — 1 of them are not acute. 

4. (IT) In a convex polygon P some diagonals were drawn, such that no two of 
them intersect in the interior of P. Show that there are at least two vertices of P 
such that none of the traced diagonals is incident to none of these two vertices. 

5. (Hungary) In ann x n chessboard we have a certain number of towers (at most 
one tower for each | x 1 square). A tower can move from one square to another 
if and only if these two squares belong to a single line or column of the board 
and there are no other towers in between. On the other hand, if two towers 
belong to a single line or column and there are no other towers in between, then 
we say that any of these towers can attack the other. Prove that one can paint the 
towers with one of three colors in such a way that no tower can attack another 
tower of the same color. 


4.3 


10. 
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We are given n distinct lines in the plane. Prove that it is possible to paint the 
regions into which these lines divide the plane of black or white, so that if two 
regions share a common edge, then they have distinct colors. 


. (TT) Each square of a chessboard is painted either red or blue. Prove that the 


squares of one of these two colors are such that a chess queen can move through 
all of them (possibly passing more than once through one or more of the | x 1 
squares of this color), without visiting a single square of the other color. We 
recall that a chess queen can move an arbitrary number of squares along any 
line, column or diagonal of the chessboard. 


. Weare given 7 points in the plane, not all of which being collinear. Prove that: 


(a) There exists at least one line that passes through exactly two of these n 
points. 
(b) The 7 points determine at least n distinct lines. 


The result of item (a) is known as the Sylvester-Gallai theorem,’ while that of 
item (b) is due to Erdés and de Bruijn.? 

In the plane we are given 2n points in general position, i.e., such that no three 
of them are collinear. If n of the given points are blue and the other n are red, 
prove that it always possible to draw n line segments satisfying the following 
conditions: 


(a) Each line segment joins a blue point to a red point. 
(b) They do not intersect each other. 


(OCS) Prove that, given a positive integer n, there exists another positive integer 
k, with the following property: given any set of k, points in space, no four of 
them being coplanar, and associating an integer number from | to 7 to each line 
segment joining two of these k, points, one necessarily gets a triangle having 
vertices at three of the k, points and such that the numbers associated to its 
sides are all equal. 


4.3 Partially Ordered Sets 


This short section discusses a few facts on partially ordered sets, focusing on the 
theorems of Mirsky® (cf. [30]) and Dilworth’ (cf. [14]). Asa corollary of Mirsky’s 
theorem, we obtain another proof of Erdés-Szekeres theorem. Dilworth’s theorem, 
in turn, will be applied to Graph Theory in Problem 25, page 134. 


4James Sylvester, English mathematician, and Tibor Gallai, Hungarian mathematician, both of the 
twentieth century. 


Nicolaas de Bruijn, Dutch mathematician of the twentieth century. 


® After Leon Mirsky, Russian mathematician of the twentieth century. 


7 After Robert Dilworth, American mathematician of the twentieth century. 
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We start by presenting the concept of partially ordered set, together with some 
examples. In the coming definition, < denotes a relation on the set A, i.e., a subset 
of the cartesian product A x A. 


Definition 4.19 We say that a nonempty set A is partially ordered by <, or that 
(A, <) isa partially ordered set, if the following conditions are satisfied, for all 
a,beA: 


(a) (Reflexivity) a < a. 
(b) (Antisymmetry) a <x b andbx<xa>a=b. 
(c) (Transitivity) a < bandbxc>aXxc. 


In this case, we also say that < is a partial order relation in A. 


In the notations of the previous definition, <x must be thought of as a way of 
comparing elements of A. This being said, note that we are not demanding that < 
provides a way of comparing any two elements of A; in other words, it may well 
happen that there exist a,b € A such that a < b and b Z a (ie., such that a < b 
and b ~ a are both false). If this is so, then we say that a and b are incomparable 
(via <); otherwise, a and b are said to be comparable. 

If < is a partial order relation on A for which any two elements of A are 
comparable, we say that < is a total order relation, or simply a total order. We 
also say that (A, <) is a totally ordered set. 

It is time that we take a look at a few relevant examples of order relations. We 
refer to Problem | for details. 


Example 4.20 


(a) The set of real numbers is totally ordered by the order relation <, defined by 
axb<a <b. In this case, we say that < is the usual order in R. 

(b) Let X be a nonempty set and F = P(X) be the family of all subsets of X. For 
Y,Z € F (e., Y,Z C X), we define Y <x Z © Y C Z. It is immediate to 
verify that (F, <) is a partially ordered set, which is usually denoted simply by 
writing (P(X), C). In this case, we say that C is the inclusion relation in X. 

(c) Given a, b €N, recall that a | b (one reads “a divides b’’) if there exists c € N 
such that b = ac. Letting < be defined on N by a x b S a | DB, it is also 
straightforward that (N, <) is a partially ordered set. In view of this example, 
one frequently says that N is partially ordered by the divisibility relation. 

(d) Let A be a nonempty set partially ordered by x. If B C A is nonempty, the 
restriction of < to elements of B turns it into another partially ordered set. In 
this case, we say that B is partially ordered by the order relation induced from 
A. 


If (A, X) is a partially ordered set and a,b € A, we write a < b to mean that 
a <x banda S b. As we indicated before, we write a £ b to mean that a < bis 
false. For what we have in mind, the coming definition is central. 
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Definition 4.21 Let A be a nonempty set ordered by <. With respect to <, a 
nonempty subset B of A is: 


(a) A chain if, for every a, b € B, we have either a < borb <a. 
(b) An antichain if, for every a, b € B, we havea < bandb Ka. 


In words (and in the above notations), B is a chain if, furnished with the order 
relation induced from A, is a totally ordered set. On the other hand, B is an antichain 
if any two of its elements are incomparable by x. 


Example 4.22 Let N be partially ordered by the divisibility relation. Letting P = 
{2,3,5,7, 11, ...} denote the set of prime numbers, it will be proved in Sect. 6.3 
that P is infinite. Thus, P is an infinite antichain on N, whereas {2, 2-3, 2-3-5,...} 
is an infinite chain. 


Example 4.23 If X = {aj,a2,...,@m} is a set with m elements, then, ordering 
F = P(X) with respect to inclusion, the family 


G = {@, {ay}, {a1, a2}, (a), a2, a3} ..., {4), a2, ..., Am}} 


is achain. Actually, F does not contain longer chains. In order to prove this assume, 
by the sake of contradiction, that 


C={A1 C Ar C... C Am42} 


is a chain on F, formed by m + 2 distinct subsets of X. Then, since |A;| € 
{0,1,2,...,m} for 1 < i < m4 2, the pigeonhole principle assures that there 
exist 1 <i < j < m+ 2 for which |A;| = |Aj|. In turn, together with Aj; C Aj, 
this gives Aj; = Aj, which is an absurd. 

On the other hand, it is immediate to verify that, for each integer 0 < k < m, the 
family 


Fx ={¥ CX; |Y| =k} 
is an antichain in F (note that Fy = {G}). Furthermore, note that 
P= Fo UF .c.U Fy 


We are finally in position to state and prove Mirsky’s theorem, which is a direct 
generalization of the situation described in the above example. For the proof of it, 
if (A, X) is a partially ordered set and B = {a; < az X ... X dp} is a chain in 
A formed by m distinct elements, then we say that B has length m and a,, is its 
maximum element. 


Theorem 4.24 (Mirsky) Let (A, <) be a partially ordered set. If A does not 
possess chains of length n + 1, then A can be written as the union of at most n 
antichains. 
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Proof Let us make induction on 1, noticing firstly that, if A does not possess chains 
of length 2, then A is itself an antichain, and the theorem is trivially true. By 
induction hypothesis, suppose that each partially ordered set which does not posses 
chains of length k can be written as a union of at most k — 1 antichains. 

Let (A, <) be a partially ordered set which does not possess chains of length 
k + 1. Define B as the subset of A formed by the maximum elements of the chains 
of maximum length in A. We first claim that B is an antichain with respect to x. By 
contradiction, suppose that there existed x, y € B such that x < y. Since x € B, we 
can find a chain C in A, of maximum length and whose maximum element is x; but 
then, C U {y} would be a chain in A, with maximum element y and longer than C, 
which is an absurd. 

Now, note that (A \ B, <) does not have chains of length k. Indeed, if C was 
such a chain in (A \ B, x), then, since A does not possess chains of length k + 1, the 
chain C would be one of maximum length in A. Therefore, the maximum element 
of C would belong to B, which is a contradiction. 

By applying the induction hypothesis to A \ B, we conclude that it can be written 
as the union of at most k — | antichains. Finally, adjoining B to such a collection of 
antichains, we write A as the union of at most (k — 1) + 1 = k antichains. oO 


As we mentioned before, we now obtain the Erd6ds-Szekeres theorem as a 
corollary of Mirsky’s theorem. Actually, the following version generalizes that of 
Example 4.8. 


Theorem 4.25 (Erdés-Szekeres) Ifa, b € N andn = ab+1, then every sequence 
(x1,...,%Xn), of distinct real numbers, either possess an increasing subsequence of 
a+ 1 terms or a decreasing subsequence of b + 1 terms. 


Proof Define A = {(i, xj); 1 <i <n} and let < be the relation in A given by 
(i, xi) X (j, xj) @ @= J) or G@ < j and x; < xj). 
It is immediate to verify that < is a partial order in A, and that 
(i1, Xi,) < (2, Xin) ~ +++ K (ik, Xix) 


is a chain in A if and only if (%,,...,xj,) am increasing subsequence of 
(x1,...,%X,). Accordingly, for 1 <i) <i2 <--- <ig <n, 


{(i1, Xi), (iz, Xin), «- +, ik, Xi, )} 


is an antichain in A if and only if (x;,,..., xi,) is a decreasing subsequence in 
(x1, ..., Xp). Indeed, since i, < is for 1 <r <s < k and all of the x;’s are distinct, 
we get 


(i;, Xi,) 4 (is, Xi,) > Xi, < Xi, is false = Xi, > Xis- 
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Now, assume that (x1, ..., Xn») does not contain an increasing sequence of a + 1 
terms. Then, A does not have chains of length a+1, so that Mirsky’s theorem assures 
that A can be written as the union of / < a antichains, say A,,..., Ay. However, 
since 


I I 
ab+1=n=|Al=|UJAil < Ail, 


i=l i=l 


it follows from the pigeonhole principle that, for some index 1 < i < J, we must 
have 


b 
aie [Sl +1e 041, 


Problems: Sect. 4.3 


1. * Check that all of the relations defined in Example 4.20 are, indeed, order 
relations. 

2. Let A be a set of n? + 1 positive integers. Prove that there exists a subset B of A, 
with n + 1 elements and satisfying one of the following conditions: 


(a) Foralla,b € B, eithera | borb|a. 
(b) For alla, b € B, onehasa{ bandb{a. 


3. State and prove a generalization of the Erdés-Szekeres theorem for a partially 
ordered finite set. 

4. Let be given m,n € N and an m x (n2" + 1) table of pairwise distinct real 
numbers. Show that it is possible to choose n + | columns of the table such that 
the m x (n + 1) table composed by the chosen columns satisfies the following 
condition: the numbers of each line, written from left to right, form an increasing 
or decreasing sequence. 

5. In a finite partially ordered set, let A and B be two chains having the greatest 
possible lengths. If AM B # GJ, prove that A and B have equal lengths. 

For the next problem, if (A, <) is a partially ordered set, we say that a € A is 
a maximal element of A if the following condition is satisfied: 


axx,xEeAS>a=x. 


6. * The purpose of this problem is to Dilworth’s theorem on partially ordered 
sets: if a finite partially ordered set (A, <) does not contain antichains of n+ 1 
elements, then A can be written as the union of n pairwise disjoint chains. To this 
end, and arguing by induction on |A|, do the following items (cf. [19]): 
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(a) Let a be a maximal element of A and assume, by induction hypothesis, that 
A \ {a} contains an antichain Bo of k elements and can be written as the 
union of k pairwise disjoint chains A;, ..., Ax. Conclude that Aj N Bo 4 @ 
forl <i<k. 

(b) For 1 <i <k, let x; € A; be the maximum element of A; belonging to an 
antichain of k elements in A \ {a}. If B = {x,, x2,..., xx}, show that B is 
an antichain. 

(c) Assume that x; < a forsome 1 <i <k. 


i. Let C = {a} U{y € Aj; y X x;}. Show that A \ C contains an antichain 
of k — 1 elements but does not contain an antichain of k elements. 

ii. Apply the induction hypothesis to A \ C and, then, conclude that A can 
be written as the union of k pairwise disjoint chains. 


(d) Assume that x; X a for every | <i < k. Show that: 


i. BU {a} is an antichain of k + 1 elements. 
ii. A can be written as the union of k + 1 pairwise disjoint chains. 


4.4 Invariants 


In the combinatorial situations we discuss here, we generally consider algorithms® 
involving somewhat random choices of entries, and try to study the set of possible 
outcomes after a certain number of iterations. In this sense, a frequently fruitful 
approach consists in trying to associate an invariant to the algorithm, ie., a 
mathematical object that behaves in a predictable way along each iteration of it, 
independently of the chosen inputs. 

Since there are no general rules that teach us which invariant should be associated 
to each particular algorithmic situation, we shall limit ourselves here to examine 
some interesting examples, in order to help the reader to grasp the general idea and 
to prepare him/her for the posed problems. 


Example 4.26 There are several + and — signs written on a blackboard. At each 
second we can replace two signs by a single one, according to the following rule: 
if the two erased signs are equal, we replace them by a + sign, whereas if they are 
different, we replace them by a — sign. Show that, when just one sign is left on the 
blackboard, it will not depend on the order we made the replacements. 


Formally, an algorithm is a finite sequence of well defined operations that, once performed on 
some (more or less) arbitrary set of data (called the input of the algorithm), furnish a definite result, 
called the outcome or output of the algorithm. On the other hand, each such performance of the 
algorithm is generally referred to as an iteration of it. We shall encounter algorithms several times 
along these notes. 
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Solution By the sign rule for products, the product of the signs written on the 
blackboard is an invariant for the described replacement algorithm. Thus, the final 
sign on the blackboard will be equal to the result of the multiplication of all of the 
+ and — signs initially written and, hence, does not depend on the order of the 
replacements. Oo 


Example 4.27 (Bulgaria) There are 2000 white balls in a box. Also, there is a 
sufficient supply of white, green and red balls outside the box. The following 
replacement operations are allowed with the balls lying inside the box: 


(a) Two white balls by a green one. 

(b) Two red balls by a green one. 

(c) Two green balls by a white and a red ones. 

(d) One white ball and one green ball by a red one. 
(e) One green ball and one red ball by a white one. 


After a finite number of these replacement operations, only three balls were left in 
the box. Prove that at least one of them is green. Does there exist a finite number of 
operations leaving just one ball in the box? 


Solution Assume that, in a certain moment, we have x white balls, y green balls 
and z red balls in the box. Let us look at how the quantity x + 2y + 3z behaves when 
performing one of the allowed operations: 


* (a): x + 2y + 3z does not change; 

¢ (b): x +2y + 3z changes tox + 2(y +1) 4+3(2 —2) =x+2y+3z-4; 

* (c):x+2y + 3z changes to (x + 1) + 2(y — 2) +3(2+ 1) =x+2y +4 3z; 

e (d): x +2y + 3z changes to (x —1)+2(Qy—-1 +34) =x+2y + 3z; 

° (e): x + 2y + 3z changes to (x + 1) + 2(y—-D+3(z- 1) =x+2y4+3z-4. 


According to the analysis above, the remainder of the division of x + 2y + 3z 
by 4 is an invariant for the described replacement algorithm. Since we began with 
x = 2000 and y = z = 0, we conclude that x + 2y + 3z is always a multiple of 4. 

Now, note that with only three balls in the box, we have x + y + z = 3 and 
x +2y+3z = 4k, for some k € N. If y = 0, then x + z = 3 and x + 3z = 4k; but, 
then, 4k = x +3z = x +3(3 — x) = 9 — 2x, which is an absurd. Therefore, we 
must have y > 1, so that there will be at least one green ball in the box. 

On the other hand, if at some moment we had only one ball in the box, we should 
have x +y+z= landx+2y+3z = 4k. The first equation shows that exactly one 
of x, y and z is equal 1, while the other two are equal to 0; but these possibilities 
are clearly inconsistent with the second equation. Therefore, we will never reach a 
situation in which just one ball is left in the box. Oo 


In the following, we analyse more sophisticated examples. 


Example 4.28 (Soviet Union) At first, we have n > | real numbers written in a 
blackboard. An allowed operation is to erase two of them, say a and b, and to write 
we in their place. Thus, after each operation, the total quantity of numbers written 


in the blackboard is one less than it was before the operation has been executed. 
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If all of the 1 initial numbers were equal to 1, prove that, when just one number is 
left on the blackboard, it will necessarily be greater than or equal to i. 


Proof Suppose that, at some moment, we have the numbers x1, x2,..., Xx41 written 
on the blackboard. At this moment, associate the number 


1 1 1 
FQ, .--, Xk) = —+t—+---+ — 
X{ X2 Xk+1 


to the blackboard. Without loss of generality, assume that we erase xz and x;z41, 
writing x, = Sit fet in place of them. Since 


4 1 1 
< 


< + 
Xk + Xk+1 Xk Xk+1 


(cf. Example 5.1 (b) of [8], for instance) it is clear that 


Ff Oty sig et) SF Os vee A): 
Hence, f(x1,..., Xx) iS a semi-invariant for our problem (i.e., the order relation 
between the values of f(x1,..., xx), computed before and after each operation, 


is an invariant for the problem—in our case, such values for a nonincreasing 

sequence). Thus, when a single number x will be written on the blackboard, we 
ought to have 

1 

n=fd,1,...,) > f@=-. 


x 


In turn, this is the same as having x > i. oO 


Example 4.29 (Putnam) Let A be the total number of distinct sequences (a1, a2, 
..., 410) Of positive integers such that 


Decide whether A is even or odd. Justify your answer. 


Solution Firstly, let us prove that the equation 


1 1 1 

—+—+.-- —_— =] 

X12 X10 
has an even number of solutions xj = aj, ..., X10 = 410 in positive integers, 
satisfying the additional condition a; 4 ap. Indeed, if x1 = a), x2 = a2, x3 = a3, 
-e+5 X10 = 410 is sucha solution, then X] = A2,X%2 = Q1, X3 = 43,..-., X10 = 410 1s 


another one, and vice-versa. Therefore, we can group the above solutions in pairs, 
so that there is an even number of them. 
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Thanks to the above argument, in order to compute the parity of the number of 
solutions of the given equation in positive integers, it suffices to compute the parity 
of the number of such solutions satisfying the additional condition a, = ap (here is 
where the idea of invariance plays a role: the parity of the number of solutions in 
positive integers does not vary if we impose that a, = a2). 

Analogously, it suffices to compute the parity of the number of solutions in 
positive integers satisfying the conditions a3 = a4, a5 = a6, a7 = ag and ao = ajo. 
In turn, this being the case, we get 


eee, (4.2) 


Following the same reasoning as above, we conclude that there is an even number 
of solutions of (4.2) in positive integers and such that aj ~ a3, as well as an even 
number of such solutions in which as # a7. Therefore, in order to compute the 
parity of the total number of solutions in positive integers, we can assume that a) = 
a3 and as = a7. The given equation now reduces to 


pf ee, (4.3) 


By repeating the same reasoning once more, we conclude that the parity of the 
total number of solutions is the same as that of the total number of solutions of (4.3) 
in which a, = as, i.e., the parity of the total number of solutions of 


4 1 1 


ay ag 
The last equation above is equivalent to 


a ae, ee (4.4) 
a, —8 a, —8° , 


ag = 


so that aj — 8 = 1, 2, 4, 8 or 16. 
Hence, we have exactly five positive integer solutions for (4.4), and the invariance 
of parity assures that A is odd. Oo 


Example 4.30 (Belarus) A cube lies on an infinite grid of 1 x 1 squares in such a 
way that one of its faces (say F) is situated exactly above one of the squares of the 
grid. The cube begins to roll on the grid, passing from one position to the next by 
rotating over one of its edges. At some moment, the cube stops exactly on the same 
1 x 1 square where it began to move. Then, one notices that the face of the cube in 
contact with the grid is F again. Is it possible that, this time, F is rotated by 90° 
with respect to its initial orientation? 


Proof It is not possible! In order to prove this, let us paint the vertices of the grid 
alternately black and white, as in ordinary chessboard. When the cube is about to 
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start rolling, F has two black and two white vertices; then, paint the remaining four 
vertices of the cube also black and white, in such a way that each of its edges joins 
two vertices of distinct colors. It is easy to check that, as the cube rolls on the grid 
from one position to the next, black (resp. white) vertices of it are always in contact 
with black (resp. white) vertices of the grid. (Here is the invariance!) However, if 
at some future moment F revisits its original position rotated by 90° with respect 
to the initial orientation, then we would have two black vertices of F exactly above 
two white vertices of the grid. This contradiction proves our initial claim. Oo 


Our last example uses a semi-invariant reasoning to construct an optimal 
configuration. 


Example 4.31 (IMO Shortlist) 155 birds are standing on a circle of center O. Two 
birds A and B are said to be mutually visible if and only if AOB < 10°. Assume 
that more than one bird can stand on a single point. Compute the smallest possible 
number of pairs of mutually visible birds. 


Solution Assume that at least one bird stands at a point A, and at least another 
one stands at a point B, where A and B are distinct points of the circle such that 
AOB < 10°. Suppose, further, that 4 birds are visible from A but not from B, 
whereas k birds are visible from B but not from A. If h < k and we let all birds in B 
to fly to A, then the total number of pairs of mutually visible birds (obviously) does 
not increase. By repeating such an operation several times, we obtain at the end a 
configuration in which two birds are mutually visible if and only if they are placed 
at a single point. Moreover, since the total number of pairs of mutually visible birds 
did not increase along the performed operations (here we have the semi-invariant!), 
in order to minimize this number, it suffices to consider such configurations. Also, 
since # - 360° = 10°, a further consequence of this reasoning and the pigeonhole 
principle is that we ought to have birds in at most 35 distinct points of the circle. 

Now, take 35 points along the circle, labelled 1, 2, 3,..., 35, and place x; > 0 
birds at the point 7. Our problem reduces to minimizing 


35 


Xj 1 = 
>»: (5) = 5 xii — D, 


i=l i=l 


under the constraint ye , xi = 155. It thus suffices to minimize 5 ee under 


the condition sy x; = 155. To this end, first note that if 1 <i < j < 35 are such 
that x; — x; > 1, then 


(x7 +%5) — (a — 1)? + (yj +17) = 2G; — xj — 1) > 0. 
Therefore, a minimizing placement of the birds must be such that the values of the 


x;’s must differ by at most 1, from where we can let x} = ... = xg = a and 
Xk] =... = X35 =a+ 1. This way, we get 
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35 
) x7 = ka? + 35 — kat 1)? = 35(a + 1)" - Qa + Dk. 
i=l 


Finally, since ka + (35 — k)(a + 1) = 155 or, which is the same, 35a — k = 120, 
it is immediate to check that the minimum possible value of pa xP is obtained for 
a = 4and k = 20. Hence, the above discussion assures that the minimum possible 
value of mutually visible birds is 


3 15+ 20 = 270 
2 2 oe 


Remark 4.32 Yet with respect to the solution above, let n € N be given and x1,..., 
x, be nonnegative integer variables. The argument just presented, of minimizing 
>-7_, x? subjected to the constraint )°?_,.x; = m, is an example of discrete 
optimization and is quite important in Combinatorics. It will make its appearance 
again, in disguised form, in Problems 26 and 27, page 135, as well as in Problem 3, 
page 142. 


oO 


Problems: Sect. 4.4 


1. (OCM) One has six equal bottles, five of them containing 21 of water each and 
the sixth one containing 1]. At any time, an allowed operation is to choose two 
bottles and split the total amount of water they contain into two equal parts (by 
pouring water from one bottle to the other). Is it possible to reach the situation 
in which every bottle has the same amount of water? 

2. (Soviet Union) At the beginning of a class, a teacher writes a second degree 
trinomial on the blackboard. At later moments, an allowed operation with the 
trinomial written on the blackboard, say ax* +bx +c, consists in replacing it by 
cx* + bx +a (if c £ 0) or by any trinomial of the form a(x +1)? +b(« +t) +c, 
with t € R chosen at random. If the trinomial initially written was x7 — x — 1, is 
there any sequence of allowed operations that makes the trinomial x? + 3x — 1 
appear in the blackboard? 

3. We cut out two | x | squares situated in opposite corners of an 8 x 8 chessboard. 
Is it possible to cover the 62 remaining squares with 31 rectangles 2 x 1? Justify 
your answer! 

4. (Argentina) Firstly, numbers 1, 2, 3, ..., 1998 are written on a blackboard. 
An allowed operation is to replace two of the numbers written, say a and b, by 
|a—b|. After 1997 operation, there is just one number written on the blackboard. 
Prove that, regardless of the order in which the replacements have been made, 
this last number will always be odd. 


122 


5. 


10. 
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A Mathematics teacher posed the following activity to his students: one of them 
would write on the blackboard a list of six integer numbers; then, a second 
student would choose three of these six numbers, say x, y and z, replacing 
them respectively by x — y — z, 3x — 3y — 2z and 4x — 2y + 4z (thus getting, 
immediately below the former one—and by repeating the three numbers that 
have not been chosen—a new list of six integer numbers on the blackboard). 
Such a procedure would, then, be repeated with this new list of six numbers 
and another student. After the class, the blackboard was partially erased, there 
remaining only two lists os numbers: 


12345 6 and372 15 8 8 


(we don’t know how many lists there were between these two left ones). Prove 
that, at some replacement, a student made a mistake. 

(TT) Ten coins are placed around a circle, all of them showing heads (so that 
tails cannot be seen at the beginning). Two moves are allowed: 


(a) To turn down four consecutively placed coins. 
(b) To turn down four coins placed as in XXOXX (here, X represents a coin to 
be turned down, whereas O represents one that should stay as it is). 


Is it possible to have all ten coins showing tails after a finite sequence of moves? 


. (Brazil) In the parliament of Terra Brasilis, each member has at most three 


enemies. Prove that the parliament can be divided into two houses, in such a 
way that, within his/her house, each member has at most one enemy. 
(Leningrad) To begin with, numbers 1, 2, 3, ..., are written on a blackboard. 
At any subsequent time, if there are at least two numbers on the blackboard, 
an allowed operation consists of erasing two of them, say a and b, and writing 
a+ b+ ab instead (so that, after the operation is performed, we have one less 
number on the blackboard). After such an operation is carried out n — | times, 
we Shall have only one number written on the blackboard. Prove that it will not 
depend on the order the operations were executed. 
At each vertex of a square we have a certain quantity of cards (possibly none). 
An allowed operation is to remove a certain quantity of cards from one vertex 
and to put twice this quantity of cards in one of the two vertices adjacent to 
that one (to this end, there is a sufficiently large supply of cards outside the 
square, all of them ready to be used). Assume that we begin with just one card 
in one vertex, the other three vertices being empty. Is it possible, after a certain 
number of steps, to have exactly 1, 9, 8 and 9 cards in the vertices of the square, 
in clockwise order? 

The next problem revisits Problem 9, page 111, with the aid of invariants. 
In the plane we are given 2n points in general position, i.e., such that no three 
of them are collinear. If n of these points are blue and the other n are red, 
prove that it is always possible to draw n line segments satisfying the following 
conditions: 


4.4 


11. 


12. 


(b) 


13. 


14. 
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(a) Each line segment joins a blue point to a red point. 
(b) They do not intersect each other. 


A cube of edge length n is partitioned into n> unit cubes. Two unit cubes are 
said to be adjacent if they share a common face. Students A and B play the 
following game on the bigger cube: A starts in a unit cube of his/her choice, 
then moving to an adjacent unit cube. Then B, departing from this last unit 
cube, moves to a third one, adjacent to the one he departed from, and so on. A 
and B play alternately, without ever revisiting a unit cube visited before. If the 
first who cannot play loses the game, and if A and B play with the best possible 
strategies, who will be the winner? 

(IMO shortlist) 


(a) Decide whether or not it is possible to number the unit squares of an 8 x 8 
chessboard from | to 64 in such a way that, for every set of four unit squares 
of one of the shapes below, the sum of the numbers written on them is a 
multiple of 4: 


Po Ee 


Do the same for the sets of four unit squares of one of the following shapes: 


cP oh 


(Leningrad) All 120 faces of twenty equal cubes are painted black or white, 
with three faces of each color per cube. Prove that the cubes can be placed on 
an opaque table in such a way that the faces touching it form the board of a 
6 x 6 square and the number of black visible faces equals the number of white 
visible ones. 

For the coming problem the reader will find it useful to recall the following 
fact, to be proved later (cf. Proposition 6.33), on the greatest common divisor 
and least common multiple of two natural numbers: given a, b € N, one has 


ab = gcd(a, b) lcm (a, b). 


(TT) Several positive integers are written on a blackboard. An allowed oper- 
ation is to erase two distinct numbers and to write, in their places, their gcd 
and lcm. Prove that, after we have repeated such an operation a certain finite 
number of times the numbers written on the blackboard do not change any 
more, no matter which additional operations we perform. 


Chapter 5 ®) 
A Glimpse on Graph Theory spooks 


We begin this chapter by considering the following three combinatorial problems: 


Problem 1 in a party with 100 guests, there is always an even number of people 
who know, within the party, an odd number of other people. 


Problem 2 a country has 10 cities and 37 two-way roads, such that each road 
connects two distinct cities and each pair of cities is connected by at most one road. 
Under such conditions, is it always possible to use the roads to travel from one city 
to another (perhaps passing through at least one other city)? 


Problem 3 a regular polygon of 100 sides is drawn on the plane, together with all 
of its diagonals. Suppose we choose at random 2501 of its sides or diagonals, and 
paint them red. Is it necessarily true that a red triangle, having vertices at three of 
the vertices of the polygon, will be formed? 


In spite of the seemingly lack of correlation between the above problems, in all 
of them we have certain sets of “objects” (guests, cities, vertices), together with 
relations between them (knowing each other, being connected by a road, being 
connected by a red segment, according to the case). These common features will 
allow us to analyse the three problems within the same abstract context, namely, 
Graph Theory. There, we concentrate on the fact that two objects may or may 
not be related, without paying actual attention to the particular kinds of objects or 
relationships between them. 

It is the purpose of this chapter to develop the most basic aspects of Graph 
Theory, together with some interesting applications. Along the way, among others 
we shall prove Euler’s theorem on the characterization of the existence of Eulerian 
paths, Cayley’s theorem on the number of labelled trees and Turdan’s extremal 
theorem on the existence of cliques. Since we will barely touch the surface of Graph 
Theory, we refer the interested reader to [13, 22] or [40] for more comprehensive 
introductions to this subject. 
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5.1 Basic Concepts 


In this chapter, unless stated otherwise, all sets under consideration will be finite. 
Given a nonempty set A, we shall denote by ?2(A) the family of the subsets of A 
having two elements: 


P2(A) = {BC A; |B] = 2}. 


_ [lA 
#P2(A) = >) 


Definition 5.1 A (simple) graph is a pair G = (V; E), where V is a finite 
nonempty set and E C P2(V). The elements of the set V are called the vertices 
of the graph G, whereas the elements of the family F are called the edges of G. 


In particular, 


If there is any possibility of confusion, we may write V(G) and E(G) to denote 
the sets of vertices and edges of a graph G. 

If G = (V;; E) is a graph and u and v are two of its vertices, we shall say that u 
and v are adjacent (or neighbors) provided {u, v} € E; in this case, we shall also 
say that the edge {u, v} is incident to the vertices u and v. Whenever there is no 
danger of confusion, we shall denote the edge {u, v} simply by wv or vu. If u and v 
are not adjacent, we shall say that they are non adjacent vertices of G. 

For most purposes, it is quite convenient to represent a graph G = (V; E) by 
means of a diagram in which the elements of V are depicted as points or tiny circles 
in the plane and the edges of G as arcs joining the corresponding vertices. Well 
understood, the diagram thus obtained has no geometric meaning, and its purpose 
is nothing but to schematically represent the adjacency relations between pairs of 
vertices of G. For instance, if 


G = ({a, b, c, d}; {{a, b}, {a, c}, {a, d}, {b, c}}), 


then G may be represented by any one of the two diagrams shown in Fig. 5.1, for 
both of them embody the same adjacency relations. 

We shall have more to say on this when we study the notion of isomorphism for 
graphs. 


Fig. 5.1 Two representations b c 
of a graph 
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Example 5.2 Givena set V with n elements and a graph G = (V; E), Definition 5.1 
allows two extreme cases, namely, those in which EF = # or E = P2(V). In the first 
case, we say that G = (V; 9%) is the trivial graph on 7 vertices, which can be 
represented by a set of n points in the plane, with no arcs. In the second case, G is 
the complete graph (or a clique) on 7 vertices, which can also be represented by a 
set of 1 points in the plane, with any two of them being adjacent. From now on, we 
shall denote a complete graph on n vertices by K,,; in particular, note that K, has 
exactly (5) edges. 


Back to the notion of adjacency, fixed a vertex u of G = (V; E), we shall denote 
by NG (u) the set of vertices adjacent to u: 


Ng(u) = {ve V; uve E}. 


The following definition is central to all that follows. 


Definition 5.3 Let G = (V; E) be a graph. For u € V, the degree of u, denoted 
dg(u) is the number of vertices of G which are adjacent to u: 


dg(u) = |Neu)]. 


Remark 5.4 Whenever the graph G = (V; £) is understood, for u € V we shall 
denote the degree of wu and the set of neighbors of u simply by d(u) and N(u), 
respectively. 


We are finally in position to state and prove the most basic result of Graph Theory, 
which is due to Euler.! 


Theorem 5.5 (Euler) Jn every graph G = (V; E), the sum of the degrees of the 
vertices is equal to twice the number of edges. In symbols: 


2\E| = >> dglu). (5.1) 


ueV 


Proof We use double counting. It suffices to observe that, if € = {u, v} is an edge 
of G, then € is counted exactly twice in the right hand side of (5.1): once in the 
summand dg(u) and another time in the summand dg(v). Hence, the right hand 
side of (5.1) has to be equal to 2|#|, since this number also counts each edge of G 
exactly twice. Oo 


The coming corollary, also due to Euler, is the most important consequence of 
the previous theorem. 


Corollary 5.6 (Euler) In any graph, the number of vertices of odd degree is even. 


' Actually, as we shall see later, Euler is usually credited as being the founder of Graph Theory. 
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Proof Let G = (V; E) be a graph and, for an integer k > 0, let ug(G) denote the 
number of vertices of G with degree k. Since the sum at the right hand side of (5.1) 
has exactly v,(G) summands equal to k, we have (again by double counting) the 
equality 


Py dg(u) = + ku (G). (5.2) 


ueV k>0 


It thus follows from Euler’s theorem that 


2/E| = ) i) da(u) = D7) kue(G) 


ueV k>0 
= DlQj + Dv2j41(G) + D> 2jv2j(G) 
j20 jzl 
= Do 10j41(G) + >) 2F (v2j(@ + 02;41(G)). 
j=0 jzl 
Therefore, 
Yo v0j41(G) = 2IE| — > 2702) (G) + 1241(@), 
j20 jzi 
which is an even number. oO 


Example 5.7 Prior to a committee’s reunion, some of its ten members shook hands. 
Is it possible that the numbers of handshakes have been, in some order, equal to 1, 
1, 1, 3, 3, 3, 4, 6, 7 and 8? 


Proof Look at the committee’s members as the vertices of a graph, with two vertices 
being adjacent if the corresponding people shook hands. Then, if the situation just 
described could have occurred, the degrees of the vertices of the graph would have 
been, in some order, equal to 1, 1, 1, 3, 3, 3, 4, 6, 7 and 8. But this would contradict 
the previous corollary, since we would have an odd number of vertices of odd 
degree. Therefore, such a situation cannot have occurred. oO 


The following definition brings the appropriate notion of equivalence for graphs. 


Definition 5.8 Graphs G = (V,; E,) and H = (V2; E2) are said to be isomorphic 
if there exists a bijection f : Vj — V2 which preserves incidence, i.e., such that, 
for every two distinct vertices u and v of G, we have 


{u,v} € Ey > {fl), fv)} € Eo. 


In this case, we denote G; ~ Go. 
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It is clear from the previous definition that two isomorphic graphs have equal 
numbers of vertices. On the other hand, Problem 4 guarantees that, for each n € N, 
the notion of isomorphism of graphs induces an equivalence relation (cf. Sect. 2.3) 
in the set of graphs of 7 vertices. 

The coming proposition gives other necessary conditions for two graphs to be 
isomorphic. 


Proposition 5.9 Two isomorphic graphs have equal numbers of vertices of a 
certain degree. In particular, they have equal numbers of edges. 


Proof Let G = (V,; E,) and H = (V2; E2) be isomorphic graphs and f : Vi > V2 
be an incidence-preserving bijection. If u is a vertex of G having degree k > 0, 
such that Ng(u) = {u1,..., ux}, then it is immediate to see that Ny(f(u)) = 
{f(u4),..., f(ux)}. In particular, f(u) is a vertex of H having degree k. An 
analogous reasoning assures that if u has degree 0, then the same holds for f(u). 

Since the argument of the last paragraph is symmetric with respect to G and 
HT (here we are using the fact that the notion of isomorphism of graphs is an 
equivalence relation), we conclude that G and H have, for every integer k > 0, 
equal quantities of vertices of degree k. 

For what is left, we apply Euler’s theorem twice: since dg(u) = dy(f(u)) for 
every u € V; and f is a bijection between the sets of vertices of G and H, we obtain 


27Eil= D> dew) = Yo du(fW)= D> da(fW) = 2IFal. 


ucV, ueV, fUuyeVa 


Therefore, |E | = | £2|. oO 


Example 5.10 If G = (V; E) is a graph of n vertices we can assume, whenever 
needed, that V = J,. Indeed, since |V| = 1, we can choose a bijection f : V > I, 
and let H = (I,; F) be defined by setting, fori # j in Jy, 


fi feF o{f'@, f We E. 


The previous example allows us to introduce a quite useful algebraic representa- 
tion of a graph, namely, its adjacency matrix. In this respect, we assume the reader 
is acquainted with the most basic concepts on matrices, which can be revisited at 
Chapter 2 of [4], for instance. 


Definition 5.11 Given a graph G = (V; E), with |V| = n, assume, according to 
the previous example, that V = [,,. The adjacency matrix of G is the n x n matrix 
Adj(G) = (aj;) such that 


afb fi Fj and i ek 
J 10, else . 
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Lemma 5.12 The adjacency matrix of a graph is symmetric, with zeros in the main 
diagonal. 


Proof In the notations of the previous definition we have, for i # j in J,, that 
ajH1lotjpeEs{,ijek oa =l, 


hence, Adj(G) is symmetric. The rest is immediate from the definition. Oo 


Remark 5.13 Given a graph G = (V; E), we shall sometimes say that G is 
labelled, an allusion to the fact that its vertices have names, or labels. The notion 
of isomorphism of graphs allows us to introduce the concept of unlabelled graph. 
More precisely, if V is a set with n elements and G is the set of graphs having 
V as set of vertices, then the restriction of the relation of graph isomorphism to 
G induces an equivalence relation in G. A unlabelled graph with n vertices is 
an equivalence class in G, with respect to the isomorphism relation. Unless we 
explicitly say otherwise, in these notes we shall consider only labelled graphs, 
generally omitting the adjective “labelled”. 


We finish this section by examining the important notion of subgraph of a graph. 


Definition 5.14 A graph H is a subgraph of a graph G if V(H) C V(G) and 
E(H) C E(G). The graph H is a spanning subgraph of G if H is a subgraph of 
G for which V(H) = V(G). 


We now show how to build two fundamental examples of subgraphs of a given 
graph. 


Example 5.15 Given a graph G = (V; E) ande ¢€ E, the subgraph of G obtained 
by the edge excision of € is the graph H = (V; E\e). From now on, we shall denote 
such a subgraph of G simply by G — e. In words, G — € is the subgraph obtained 
from G by erasing the edge e€. Note, further, that G — € is a spanning subgraph of 
G. 


Example 5.16 Given a graph G = (V; FE) and u ¢€ V, the subgraph of G obtained 
by vertex excision of u is the graph H = (V \ {u}; E’), where 


E' = E \{e € E; € is incident with u}. 


From now on, we shall denote such a subgraph of G simply by G — u. In words, 
G — u is the subgraph of G obtained by erasing vertex u, together with all of the 
edges of G incident to it. Note, further, that G — u has one vertex less and dg (u) 
edges less than G. 


Corollary 5.6 gave a necessary condition for a list of nonnegative integers 
to be the list of degrees of some simple graph. Although there is no simple 
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sufficient condition to decide whether this is so, we finish this section by presenting 
Havel-Hakimi’s~ simple algorithm, that allows one to quickly decide it for any given 
list of nonnegative integers. We follow [22]. 


Theorem 5.17 (Havel-Hakimi) /f s is a positive integer, then the nonnegative 
integers 


S>th>thn>...2>t 2d, >...>dy 
are the degrees of the vertices of some simple graph if and only if so are 
ty —l,t2-—1,...,t; —1,d),...,dp. 


Proof Let G be a graph whose vertices have degrees s >t) >t2>...> ty > dj => 
. > dy. Since s is positive and the vertex u of degree s has exactly s neighbors, 
we conclude that ft}, fo,..., tf; = 1 (mote that this follows from the fact that we have 


listed the degrees in descending order). For 1 <i < s, let v; be the vertex of degree 
t; and, for 1 < j <n, let w; be the vertex of degree d;. We distinguish two cases: 


(i) Ng) = {v1,..., vs}: then the vertices of H := G — u have degrees ty — 
lto—1,...,t; —l,dy,..., dp. 
(ii) Neu) ¥ {v1,..., vs}: then, for some 1 <i <s andsome | < j <n, we have 


u nonadjacent to v; and adjacent to wj. In particular, t; => d;, and we consider 
two subcases (make some drawings to follow the reasoning): 


* t; = dj: let G’ be the graph obtained from G by exchanging v; and w;. 

* ¢; > dj: then v; has a neighbor x which is not a neighbor of w;. Let G’ be 
the graph obtained from G by erasing the edges {u, w;}, {v;, x} and adding 
the edges {u, v;}, {w;, x}. 


After having performed either i. or ii., we are left with a graph G’ whose 


list of degrees is equal to that of G, and such that Ng (u) M {vj,..., vs} 
strictly contains Ng(u) MN {v1,..., vs}. If Neu) = {v1,..., vs}, we return 
to i. Otherwise, we repeat ii. until we reach such a situation. In the end of 
this process, we will find a graph H having degrees t; — 1,% — 1,...,t) — 
1,d,,...,dn. 

Conversely, assume that (t] — 1, fo -1,..., ts —1,d1,..., dy) is the sequence of 
degrees of some simple graph H, and let u; be a vertex in H with degree t; — 1. Form 
a graph G by adding a vertex to H and making it adjacent to w1,..., us. Then, the 
sequence of degrees of the vertices of G is precisely (5, t, fo,...,ty,d1,..-,dn). 

Oo 


2 After Vaclav J. Havel, Czech mathematician, and Seifollah L. Hakimi, Iranian-American mathe- 
matician, both of the twentieth century. 
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Problems: Sect. 5.1 


1. Solve Problem I at the beginning of this chapter. 

2. Prove that every graph has at least two vertices with the same degree. 

3. Given graphs G; = (V1; Ei) and Gz = (V2; E2), with |Vij| = |V2|, prove 
that G; and G2 are isomorphic if and only if Adj(G2) can be obtained from 
Adj(G 1) by means of a permutation of rows. 

4. * Prove that graph isomorphism is an equivalence relation in the family of 

graphs. 

. Compute the number of (labelled) graphs with vertices uw), ..., Un. 

6. Prove that there does not exist a graph of seven vertices, with degrees 1, 1, 2, 3, 
4, 5 and 6, in some order. 

7. Give an example of a graph of eight vertices, of degrees 1, 1, 2,3, 4, 4, 5 and 6 
in some order. 

8. Given n € N, show that there exists a simple graph of 2n vertices, whose 
degrees aren,n,n—1,n—1,...,3,3,2,2,1,1. 

9. *If G = (V; E) isa graph, its complement is the graph G = (V; E°), where 
E* denotes the complement of E in P2(V). If |V| =n, prove that: 


Nn 


(a) dg(u) + dg(u) =n — 1, foreveryu e V. 
(b) |E| + |E*| = (3). 


10. A graph is self-complementary if it is isomorphic to its complement. If G is a 
self-complementary graph with n vertices, prove that n leaves remainder 0 or 1 
upon division by 4. 

11. * Generalize Examples 5.15 and 5.16. More precisely, given a graph G = 
(V; £) and’ A BCV,0 AAC E, define the subgraphs G — B and 
G — A of G, respectively obtained by the excision of the vertices in B and of 
the edges in A. 

12. * Given a graph G and a subset A of V(G), the subgraph of G induced by A 
is the graph G|4 = (A; E’), where 


E' = {{u, v} € E; u,v € A}. 


Prove that: 


(a) For u € V(G), one has G — u = Giv(G)\(u}- 
(b) Gj4 = G — A, where A® denotes the complement of A in V(G) and 
G — A‘ is defined as in the previous problem. 


13. Let G = (V; E) be a graph. We choose two nonadjacent vertices, say u and v, 
and add edge {u, v} to E, thus obtaining a new graph H = (V; F). Prove that 
the difference between the numbers of odd degree vertices in H and in G is —2, 
0 or 2. Then, use this fact to give another proof of Corollary 5.6. 
For the coming problem, the reader may find it convenient to recall the 
discussion on congruence modulo n, in Sect. 2.3. 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 
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* Given integers 1 <n < m, let G(m,n) = GU; E) be the graph such that, 
for! <i, 7 <m, 


fi, j}e Es j =i +n(modm). 


(a) Represent G(m,n) when m = 6 and n varies from | to 5. 

(b) Ifm = 2n, prove that each vertex has degree | and | E| = n. Then, represent 
the graph thus obtained. 

(c) If m ¥ 2n, prove that each vertex has degree 2 and |E| = m. 

(d) Prove that G@m,n) ~ G(m,m —n). 


* In the notations of the previous problem, represent the graph obtained from 
G(5, 1) and G(5, 2) by joining, for 1 < i <5, vertex i of G(5, 1) to vertex 2i 
of G(5, 2). The graph thus obtained is known as Petersen graph.* 

For the next problem, one says that a nonempty subset A of the set of vertices 
of a graph G is independent if any two vertices in A are nonadjacent in G. 
* A graph G = (V; E) is bipartite if one can write V = V, U V2, with Vi 
and V2 being disjoint, nonempty and independent sets of vertices. Prove that 
|E| < |Vi|-|Vo|, with equality if and only if each vertex of Vj is adjacent to 
some vertex of V2, and vice-versa. 
* In the notations of the previous problem, a bipartite graph G = (V U V2; E) 
is said to be complete if |E| = |Vi| - |V2|. If H = (Wi U Wo; F) is another 
complete bipartite graph, prove that 


G2As[|Vi| =|Wi| and |V2| = |W2l|, or vice-versa. 


Thanks to such a result, if |V;| = m and |V2| = n, we write simply K,,., to 
denote the complete bipartite graph G = (Vj U Vo; E). 

* Tn the notations of Problem 16, let V; = {1,...,m}and V2 = {m+1,...,m+ 
n}. Prove that the adjacency matrix of G has the form 


Adj(G) = ae al 


where A is an m x n matrix and A! denotes the transpose of A. 

A bipartite graph G = (V; U V2; E), with independent sets of vertices V; and 
V>, has 16 vertices of degree 5 and some (at least one) vertices of degree 8. If all 
of the vertices of degree 8 belong to V;, compute how many vertices of degree 
8 the graph G can have. 

* A graph G is r-regular if all vertices of G have degree r. 


3 After Julius P. C. Petersen, Danish mathematician of the nineteenth century. 
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21. 


22. 


23. 


24. 


29. 
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(a) If G is an r-regular graph with n vertices, prove that 2 | nr and that G has 
exactly 5 edges. 
(b) Given natural numbers n and r, such that r < n and nr is even, show how 


to construct an r-regular graph with n vertices. 


A battalion commander asked volunteers to compose 11 patrols, all of which 
formed by the same number of men. If each men entered exactly two patrols 
and each two patrols had exactly one men in common, compute the numbers of 
volunteers and of members of each patrol.* 

Let G = (V; U Va; E) be a bipartite graph, with |Vi| = m and |V2| = n. It is 
known that all vertices in V; have degree g, all vertices in V2 have degree 2 and 
|N(u) M N(v)| = 1, for all distinct vertices u,v € Vj. 


(a) Compute g and n in terms of m. 
(b) For the values of g and n computed as in (a), show that there is a bipartite 
graph satisfying the stated conditions. 


(Japan) A total of x students entered a Mathematics competition in which 2y 
problems were proposed. It is known that each student solved y problems and 
each problem was solved by the same number of students; moreover, each two 
students solved exactly three problems in common. Compute the values of all 
possible pairs (x, y) and, for each such pair, exhibit a configuration showing 
which problems were solved by each student. 

(Hungary) Prove that it is impossible to place the numbers 1, 2, 3,..., 13 around 
a circle so that, for any two neighboring numbers x and y, we have 3 < |x—y| < 
Be 

Let G = (V; E) be a graph. A subset V’ of V is a vertex cover of G if every 
edge of G is incident to least one vertex in V’; a vertex cover V’ is minimal if 
G has no vertex cover with less elements than V’. A matching in G is a subset 
E’ of E formed by pairwise disjoint edges; a matching E’ is maximal if G 
has no matching with more edges than E’. The purpose of this problem is to 
prove Kénig’s theorem®: in every bipartite graph, the number of vertices in a 
minimal vertex cover is equal to the number of edges in a maximal matching. 
To this end, let G = (V, U V2; E) be bipartite, with independent sets of vertices 
VY, and V2, and do the following items: 


(a) Let x be the relation on Vj U V2 given byu xv Su € Vj, v € Vo. Show 
that < is a partial order on Vj U V2. 
(b) With respect to <, show that: 


i. An antichain is the same as the complement of a vertex cover in G. 
ii. A collection of disjoint chains of lengths greater than 1 is the same as a 
matching in G. 


For another approach to this problem, see Problem 5, page 49. 


After Dénes Kénig, Hungarian mathematician of the twentieth century. 
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Fig. 5.2 A transitive (left) or 
cyclic (right) K3 


26. 


(c) Apply Dilworth’s theorem (cf. Problem 6, page 115) to prove K6nig’s 
theorem. 


For the coming problem, we define a directed graph, or digraph as a graph 
G such that we associate an orientation to each one of its edges. Informally, this 
can be thought of as an arrow along the edge; formally, each edge of a digraph 
is an ordered pair (instead of a set) (u, v), for some distinct u, v € V(G), such 
that if (u, v) is an edge then (v, uw) is not. Given a vertex u of a digraph G, 
we define the out degree ae (u) of u as the number of arrows departing from 
u; analogously, the in degree of u, denoted d¢ (u), is the number of arrows 
arriving at u. Hence, 


dg(u) = d&(u) + dg (u) 


is the degree of u in G, thought of as an ordinary graph (1.e., the graph obtained 
by removing the orientations of the edges of the digraph). 

If a digraph contains a K3, then such a K3 is said to be transitive or 
cyclic, provided it satisfies the situation depicted at left or right, respectively, in 
Fig. 5.2. A digraph G is complete, or a tournament if it is a complete graph 
when we remove the orientations of its edges. 

* If G is a complete digraph of n vertices, prove that: 


(a) Di<cydt@ = (@), where V is the set of vertices of G and we write d* (u) 
in place of ae (u). 
(b) The average out degree of G is aes oe 


(c) The number of transitive K3’s in G is )*, eV (7), 
(d) The number of cyclic K3’s in G is given by 


n 1(/n 1 ais. 3 
(3)+2() “abe 


(e) The number of cyclic K3’s in G is also given by 
a=) _ atw-a7y 
A ee 


ueV 


Then, conclude that this is at most 


n(n? — 1)/24, if n is odd 
n(n* — 4)/24, if n is even ” 
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27. (Japan) Fourteen people dispute a chess tournament in which each two players 
have a match and there are no ties. Compute the largest possible number of sets 
of three players such that, in the three matches involving two of them, each 
player wins one match and loses another one. 


5.2 Paths, Walks and Cycles 


In this section we study the problem of deciding whether a graph is connected, 1.e., 
has only one piece. As a byproduct, we introduce the concepts that name the section 
and prove several important results related to them. In particular, we give a necessary 
and sufficient condition for the existence of an Eulerian walk in a connected graph, 
and characterize bipartite graphs in terms of cycles. 


Definition 5.18 A walk of length k > 1 in a graph G is a sequence 
P = (uo, U1, ..-, Uk) (5.3) 


of (not necessarily distinct) vertices of G, such that uj, is adjacent to u;, for 1 < 
i <k. A walk P as above is said to be closed if up = ux. 


The graphs we wish to qualify are those for which there is a walk between any 
two vertices. Since these graphs are quite important for our subsequent discussion, 
we isolate them in the coming 


Definition 5.19 A graph is connected if there exists a walk between any two of its 
vertices. A non connected graph is said to be disconnected. 


The coming proposition furnishes a simple (sufficient, albeit obviously not 
necessary) criterion for a graph to be connected. For its statement, we need to 
introduce a concept which will also be useful in other future context: given a graph 
G = (V; E), its minimum degree is the nonnegative integer 5(G) given by 


6(G) = min{dg(u); u € V(G)}. (5.4) 


Proposition 5.20 [f G is a graph of n vertices for which 5(G) = |%], then G is 
connected. 


Proof If n = 1, there is nothing to do. If nm = 2, then 6(G) = 1 and, hence, the two 
vertices of G are adjacent. Suppose, then, that n > 2, and let u and v be distinct 
vertices of G; we shall show that there is a walk between them. If u and v are 
adjacent, there is nothing to do. Otherwise, since 
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dgtu) + dg(v) = 2|n/2| >n-—2= #(V(G) \ {u, v}), 


the pigeonhole principle guarantees the existence of a vertex w ¥ u, v, adjacent to 
both u and v. Therefore, the walk (u, w, v) joins u and v. oO 


For a fixed graph G, it is immediate to verify that the relation ~ in V(G), defined 
by letting 


u~vu<> u=v or there exists a walk in G joining u and v, (5.5) 


is of equivalence. Also, note that an equivalence class of V(G) with respect to such 
a relation is, in particular, a set of vertices of G. 

For the next definition, the reader may find it convenient to review the notion of 
subgraph induced by a set of vertices, in Problem 12, page 132. 


Definition 5.21 The connected components of a graph G are the subgraphs G|,4 
of G, induced by the equivalence classes A of V(G) with respect to the equivalence 
relation (5.5). 


If A is an equivalence class of vertices of G with respect to (5.5), then, in 
particular, there is a walk between any two distinct vertices in A, so that Gj), is 
a connected graph. On the other hand, if H is a connected subgraph of G containing 
Gia, then H = Gy,. Indeed, if u € V(H) andv € A C V(A), then the 
connectedness of H assures the existence of a walk in H (and hence in G) joining 
u and v; therefore, u € A, since A is an equivalence class of vertices of G with 
respect to the existence of walks. Thus, we have A C V(H) C A, from where it 
follows that H = G\,4. The above argument allows us to assert that the connected 
components of a graph are precisely its maximal connected subgraphs. 

Back to the analysis of the walks in a graph, we collect yet another relevant piece 
of terminology, which goes back to Euler. In this sense, if P is a walk in a graph G 
as in (5.3), we say that {u;_1, u;}, for 1 <i <k, are the edges of G traversed by P. 


Definition 5.22. An Eulerian walk in a connected graph is a closed walk which 
traverses each edge of the graph exactly once. A connected graph is Eulerian if it 
contains an Eulerian walk. 


Let G be an Eulerian graph and P be an Eulerian walk in G, as in (5.3). For an 
arbitrary vertex u of G, if k > 1 is the number of occurrences of u in P, we claim 
that dg(u) = 2k, an even number. This is obvious if all such occurrences of u are of 
the form u = u;, with | <i < k (for in this case uj;_; and u;+1 are distinct vertices 
of G, both adjacent to w); on the other hand, if wo = u, note that uz, = u too, for P is 
closed. Therefore, a necessary condition for a connected graph G to be Eulerian is 
that all vertices of G are of even degree. Theorem 5.24 below guarantees that such 
condition is also sufficient for the existence of an Eulerian walk. Before we discuss 
it, we need an important auxiliary result. 
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Lemma 5.23 If G is a connected and nontrivial graph in which every vertex has 
even degree, then G contains a closed walk which does not traverse the same edge 
twice. 


Proof Choose a vertex ug of G and, starting at uo, construct a walk in G by 
performing the following operation as many times as possible: being at a vertex 
ug—1 Of G, with k > 1, choose a vertex ux of G, adjacent to ug—; and such that the 
edge {uz—1, ux} has not been chosen yet. 

In order to show that the algorithm described above does generate a closed walk 
in G, assume that, after a certain number of (at least one) operations, we find 
ourselves in a vertex ux of G. There are two possibilities: 


(i) ux = uo: there is nothing to do, for the closed walk is (uo, u1,..., Ux). 

(ii) ux 4 Uo: suppose that the vertex ux has appeared (with smaller indices) other 
1 times along the performed operations; this used exactly 2/ + 1 of the edges 
incident to uz (two edges for each of the / previous occurrence of ux, and an 
extra edge to reach ux from uz—1). However, since ux has even degree, there 
exists at least one edge incident to ux; and which has not been used yet; hence, 
the algorithm does not stop at ux. 


Finally, since the number of edges in G is finite, the above discussion assures that 
the algorithm stops at uo and, when it does so, we have succeeded in constructing a 
closed walk in G which does not traverse the same edge twice. oO 


We are now in position to prove the result of Euler alluded to above. 


Theorem 5.24 (Euler) A connected and nontrivial graph G is Eulerian if and only 
if all of its vertices have even degree. 


Proof We have already seen, in the paragraph preceding Lemma 5.23, that the 
condition that all vertices of G have even degree is indeed necessary for the 
existence of an Eulerian walk. In order to establish the sufficiency, let us make 
induction on the number of edges of G. 

Since G is nontrivial, if all of its vertices have even degree, then it has at least 
three edges; moreover, if G has exactly three edges, then it is immediate to prove 
that it is isomorphic to K3, which is clearly Eulerian. 

Now, assume that G has n > 3 edges, and that the theorem is true for all 
connected graphs with less than n edges and such that all vertices have even degree. 
By the previous lemma, we can take in G aclosed walk P = (uo, u1,..., Uk—1, UO), 
that does not traverse the same edge twice. Let A be the set of edges in P, and 
A,..., Hj; be the nontrivial connected components of G — A (cf. Problem 11, 
page 132). Ifa vertex u of Hj occurs j > O times in P, then 


dg(u) = dy,(u) + 2], 


so that dy, (uw) is even. Since this is true for all indices | < i < / and all vertices 
u € V(H;), we conclude that H), ..., H; do satisfy the hypotheses of the theorem. 
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Since each H; has less edges than G, the induction hypothesis guarantees the 
existence of an Eulerian walk 7; for H;, for 1 < i < /. At this point, the idea is 
to glue the Eulerian walks P), ..., P; to P one by one, in a way to get an Eulerian 
walk for G. To this end, note that and P; have at least one vertex in common, 
say u. We may assume, without loss of generality, that P and 7? start and end at 
u (it suffices to change the starting points of both these closed walks, if necessary). 
Hence, following P and subsequently 1, we obtain a closed walk Pi in G, which 
encompasses all of the edges of P and P, and does not traverse one of them twice. 
It now suffices to repeat the above argument more / — | times, gluing P2 to P; to get 
a closed walk Pras and so on. The closed walk P; , obtained at the end of this process 
will be an Eulerian walk in G. oO 


The above result, proved by L. Euler in 1735, marks the birth of Graph Theory. 
Euler came to it after having been inquired on the possibility of performing a walk 
along the streets of the prussian city of Kénigsberg (nowadays the Russian city of 
Kaliningrad), traversing exactly once each of the seven bridges the city had, at that 
time, over the Pregel river. 

Taking into account that these bridges join two islands of the Pregel to the 
remaining parts of the city, Euler modelled the problem posed to him as the search 
for finding, in the graph of Fig.5.3, a walk of the type we refer today as being 
Eulerian. Although such a graph is not simple (notice the multiple edges joining 
two vertices, a situation that, by the sake of simplicity, we did not allow in our 
presentation of the theory), a moment’s reflection will easily convince the reader 
that the previous theorem remains true for non simple connected graphs. Therefore, 
this result (actually, the adaptation of the argument for the proof of the necessity 
part of it, given at the paragraph preceding Lemma 5.23) guarantees that the graph 
of Fig. 5.3 does not contain an Eulerian walk, for all of its vertices have odd degree. 

Back to the general development of the theory, we finish this section by studying 
the concepts of paths and cycles in graphs, as well as looking at some relevant 
applications of them. 


Definition 5.25 A path in a graph is a walk with distinct vertices. 


Fig. 5.3. Graph representing Cc 
the bridges of K6nigsberg 


o> 
Q 
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Even though every path is a walk, it is clear that the converse is not true in 
general, namely, that there are walks which are not paths. Nevertheless, it is not 
difficult to convince ourselves (cf. Problem 7) that, if a graph contains a walk 
between two of its vertices, then it also contains a path between them. 


Definition 5.26 A k-cycle in a graph G is a closed walk 
(U0, U1, -.-, Uk-1, Uo) 


in G, of length k > 3 and such that (uo, w1,..., Ue—1) is a path. 


The coming result uses the concept of cycle to characterize bipartite graphs (cf. 
Problem 16, page 133). 


Proposition 5.27 A graph G is bipartite if and only if every cycle in G has even 
length. 


Proof Firstly, let us assume that G is bipartite, with independent sets of vertices Vj 
and V2. If C = (uo, u1, u2,..., Uk—1, Uo) is a cycle in G, then the independence of 
the sets V; and V2 guarantees that vertices uj; and u;+1 alternate between the sets Vj 
and V2, for0 <i < k —1 (with uz; = uo). In particular, the path (uo, wi, ..., ue—1) 
alternates k — | times between Vj and V2 for, starting from uo, arrive at uz_}. 
However, since uz—; and uo are adjacent, they cannot belong to a single one of 
the sets V; or V2 and, hence, k — 1 must be odd. Therefore, the length k of C must 
be even. 

Conversely, assume that every cycle in G has even length. We use induction on 
the number of edges to show that G is bipartite. If G contains some cycle C, erase 
an edge € = {u, v} of C. Since no further cycles are created, all of the cycles of 
G — € are also of even length. Hence, by induction hypothesis, G — € is bipartite, 
with independent sets of vertices V; and V2, say. We shall show that V; and V2 are 
also independent in G. To this end, note that the path C — € in G — € joins u to v and 
has odd length; since its vertices alternate between V| and V2, it follows that u € Vj 
and v € V2, or vice-versa. Therefore, V; and V2 are also independent in G, so that 
G is bipartite too. Oo 


The coming definition, due to the nineteenth century Irish astronomer, math- 
ematician and physicist William R. Hamilton, is, in a certain sense, dual to the 
definition of Eulerian walk. 


Definition 5.28 A hamiltonian cycle in a graph is a cycle that passes through 
all of the vertices of the graph. A connected graph is hamiltonian if it contains 
a hamiltonian cycle. 


Not every connected graph is hamiltonian; for instance, any graph without cycles 
is not hamiltonian (for not-so-trivial examples, see Example 5.30 or Problem 21). 

The question of the existence or not of hamiltonian cycles in graphs is much 
more difficult than the corresponding problem for Eulerian walks. Actually, up to 
now there does not exist a set of simple necessary and sufficient conditions for a 
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graph to be hamiltonian. On the other hand, the coming result, due to the Hungarian 
mathematician of the twentieth century Gabriel Dirac, provides a simple sufficient 
condition for the existence of hamiltonian cycles in a graph. For its statement, the 
reader may find it helpful to review the definition of minimum degree of a graph, 
in (5.4). 


Theorem 5.29 (Dirac) Jf G is a graph with n > 3 vertices and such that 5(G) > 
5, then G is hamiltonian. 


Proof Proposition 5.20 gives the connectedness of G. Now, let P = (uo, ..., ug) be 
a maximal path in G (i.e., one for which there is no other of greater length). If v isa 
neighbor of uo not belonging to P, we can change P by P| = (v, uo, ..., Ux), thus 


obtaining a path in G bigger than P, which is a contradiction. Hence, P includes 
all of the neighbors of uo and, analogously, all of the neighbors of ux. In turn, this 
assures that the length of P is at least 5. 

We first claim that there exists an index 0 < i < k — 1 such that uo is adjacent to 
uj+1 and u; is adjacent to ux. In order to show this, let A = {0 <i <k—1; uj4i € 
Ncg(uo)} and B = {0 <i < k—1; uj € NG(ux)}. Since all of the vertices in 
Ng(uo) and all of the vertices in Ng(uxz) are in P, it follows that |A|,|B| > 
Since |A U B| < k, we then get 


|AN B] =|A|+|B|-|AUB| >n—-k>0, 


for P is a path in G with k + | vertices, so thatk+ 1 <n. 
Now, choose 0 < i < k — | as in the above claim, and consider the cycle 


C = (uo, Wid, Ui42,---5 Uk—1, Uk, Ui, Ui-1,...U1, UO). 


We claim that C is hamiltonian. By contradiction, suppose that some vertex of G, 
say v, is not in C. Since C has more than 5 vertices and v has at least > neighbors, 
pigeonhole’s principle (applied as in the proof of Proposition 5.20) guarantees the 
existence of a vertex w in C such that w is adjacent to v. Renaming the vertices of 
C, we can assume that 


C = (vo, U1, .--, Uk, VO), 


with vo = w. But then, P; = (v, vo, v1, ..., vg) would be a path in G bigger than 
P, which is impossible. Oo 


The lower bound on 6(G) asked by Dirac’s theorem is the best possible one, in 
the sense of the following example. 


Example 5.30 Letn > 1 be odd and let G be the graph of n vertices formed by the 
union of two copies of K(,41)/2 with a single common vertex, say u (i.e., take two 
disjoint copies of K(n+1)/2, choose one vertex out of each of them and make these 
two vertices coincide). One clearly has 6(G) = ast < as Nevertheless, G does 
not contain a hamiltonian cycle, for every cycle in G containing u must be entirely 
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contained in one of the copies of K(n+1)/2 which originated G. Indeed, if C is such 
a cycle, then C — u is a connected subgraph of G — u, which, in turn is the disjoint 
union of two copies of K(,—1)/2; therefore, C — u is contained in a single one of 
these two copies and, hence, is also contained in a single one of the two copied of 
K(n+1)/2 from which we formed G. 


Problems: Sect. 5.2 


1. For this problem, recall that if A = (4jj)mxn and B = (bjx)nxp are m x n and 
n X p matrices with entries a;; and b jx, then the product matrix of A and B (in 
this order) is AB = (cjx)mx p Such that, for 1 <i < mand 1 <k < p, one has 


n 
Ge) aah (5.6) 


This being said, do the following items: 


(a) If M is the adjacency matrix of a graph G of vertices v1, ..., Uz, use the 
above formula to prove that, fori 4 j, entry (i, 7) of M* equals the number 
of distinct paths of length 2 in G, joining vertices v; and v;. 

(b) Generalize (5.6), showing that if Aj,..., A, aren xn, with A; = (i, )nxn> 
then the product matrix of Aj, ..., Ax (in this order) is Ay...Ax = 
(bij nxn, Such that 


k 
bg au ip, Uy + ay the aj? 


eat 


with the indices /;,...,/, in the above sum varying over all possibilities 
1<h,...,&<n. 

(c) Generalize item (a) in the following way: if M is the adjacency matrix of 
a graph G of vertices v1, ..., Vv, and k > | is an integer, show that entry 
(i, j) of the matrix M* equals the number of distinct walks of length k in 
G, joining vertices v; and vj. 


2. Given a graph G and a vertex u of G, let H = (V; E), where V is the set of the 
vertices of all paths in G departing from u and E is the set formed by the edges 
of these paths. Prove that H is the connected component of G containing wu. 

3. Prove that every graph with n vertices and at least ee + 1 edges is connected. 
Give an example of a graph with n vertices and (Cy ') edges which is not 
connected. 

4. Solve Problem 2, posed at the beginning of this chapter. 


5.2 


10. 


11. 


12. 


13. 
14. 


15. 


16. 
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For the coming problem, you shall need the following fact, to be established 
in Proposition 6.26: given nonzero integers m, n and c, there exist integers x 
and y such that c = mx + ny if and only if gcd(m, n) divides c. 


. Let m and n be given naturals and d = gcd(m, n). Prove that the graph G(m, n), 


described in Problem 14, page 132, has exactly gcd(m,n) = d connected 
components. 


. Show that if a graph G is disconnected, then its complement G (cf. Problem 9, 


page 132) contains a complete bipartite graph with the same set of vertices of 
G. In particular, conclude that either G or G is connected. 


. * Tf there is a walk between two vertices of a graph, prove that there is also a 


path between them. 


. Find an Eulerian walk in the graph depicted below: 


. A semi-Eulerian walk in a connected graph G is a walk that starts and ends in 


distinct vertices of G and traverses each edge of G exactly once. Prove that G 
has a semi-Eulerian walk if and only if it has exactly two odd degree vertices. 
Prove that, in every connected graph, two paths whose lengths are the largest 
possible ones have at least one common vertex. 

A graph G has n + 2 vertices, u1, U2, ..., Un, V, W, and satisfies the following 
conditions: v and w are both adjacent to wu, ..., 4,, but not to one another; u1, 
..., Uy are pairwise adjacent. For a fixed integer 2 < k < n+ 1, compute the 
number of paths of length k in G, joining v to w. 

If all vertices of a graph have degrees at least 2, prove that the graph contains a 
cycle. 

Prove that every 2-regular graph (cf. Problem 20, page 133) is a cycle. 

(TT) During a meeting, each one of five mathematicians slept exactly twice. 
It is also known that, for each two of those five mathematicians, there was a 
moment in which both were sleeping. Prove that, at some moment, three of 
them were sleeping. 

(IMO) Let G be a connected graph with n edges. Prove that it is possible to label 
the edges of G with the integers | to n, in such a way that the labels attributed 
to the edges incident at each vertex of degree greater than | are relatively prime. 
* An edge € of a graph G is said to be a cutting edge provided G — € has more 
connected components than G. If G is a connected graph and € is a cutting edge 
of G, prove that G — € has exactly two connected components. 
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For the coming problem, given a digraph G (cf. Problem 26, page 135) and 
two of its vertices, say u and v, we define an (oriented) walk in G, from u to 
v, to be a sequence (uo, ..., ux) of vertices of G such that uo = u, up = Vv 
and the edge joining u; and uj+1 is oriented from u; to uj+1, forO <i <k.In 
this case, we say that k is the length of the walk. A path is a walk with distinct 
vertices. Finally, recall that a complete digraph or tournament is a digraph that 
turns into a complete graph when we remove the orientation of its edges. 

17. In a tournament G, show that there exists a vertex from which one can reach 
any other vertex with a path of length at most 2. 

18. (TT) In Shvambrania there are n cities, each two of them connected by a road. 
These roads do not intersect; if necessary, viaducts are used for some of them to 
overpass others. An evil wizard wants to establish one way directions in each of 
the roads, so that, if someone leaves a certain city, he/she can no longer return. 
Prove that: 


(a) The wizard can actually establish such rules. 

(b) Whatever the way the wizard chooses to set his rules, there will always be a 
city from which one can reach any other, and a city from which one cannot 
leave. 

(c) The wizard can execute his intent in exactly n! distinct ways. 


19. (Brazil) Consider ann x n chessboard (n > 1 integer), and choose one of its n2 
unit squares, say co. A path® in the chessboard is a sequence (co, C1,..-, Cm) 
of distinct unit squares of the chessboard such that, for eachO < j < m, 
squares c; and c;+1 share a common edge. Such a path is said to be optimal if 
it contains all of the n? unit squares of the chessboard. Moreover, we say that a 
path as above contains an U if, for some 0 < j < m — 3, the unit squares c; 
and cj+3 share a common edge. Show that every optimal path contains an U. 

20. (Tchecoslovaquia) Prove that, in every connected graph, there exists a vertex 
such that the average degree of its neighbors is greater than or equal to the 
average degree of all vertices of the graph. 

21. The purpose of this problem is to prove that Petersen’s graph (cf. Problem 15, 
page 133) is not hamiltonian. To this end assume, by contradiction, that such 
a graph possesses a hamiltonian cycle C. In the notations of Fig. 5.4, do the 
following items: 


(a) Prove that C contains either two or four of the edges {a, a’}, {b, b’}, {c, c’}, 
{d, d'} and {e, e’}. 

(b) If four of the edges in (a) belong to C suppose, without loss of generality, 
that {b, b’} is the one not belonging to C. Conclude that {a’, b’}, {b’, c’}, 
{b, e} and {b, d} are inC and use this to reach a contradiction. 

(c) If two of the edges of (a) are in C assume, without loss of generality, that 
{a, a’} is one of them. Conclude that exactly one of the edges {a,c} or 


Note that, in principle, this definition of path does not coincide with the one given in the text. 
Nevertheless, we have chosen to translate it according to the original. 
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Fig. 5.4 Petersen’s graph a! 


{a, d} is in the cycle, say {a, c}. If {a,c} is in C, conclude that the other 
edge of (a) which is in C is {d, d’}, and reach a contradiction. 


22. Let k and n be integers such that 2 < k < n. If f(n,k) denotes the largest 
possible number of K;’s in K,, such that any two of these K;’s have at most 
k — 2 vertices in common, prove that 


1 n ne n 
aly) < fe = £4). 


23. (Vietnam) Find the least positive integer k for which there exists a graph of 25 
vertices such that each one of them is adjacent to exactly k others, and each two 
nonadjacent vertices are both adjacent to a third vertex. 

24. (Austrian-Polish) A rectangular building is formed by two rows of fifteen 
square rooms (situated as the unit squares of two adjacent rows of an ordinary 
chessboard). Each room has one, two or three doors, that lead respectively to 
one, two or three adjacent rooms. It is known that the doors are distributed in 
such a way that it is always possible to go from one room to another without 
leaving the building. Compute how many are the distinct ways of distributing 
the doors along the building. 


5.3 Trees 


In this section we study connected graphs which do not have cycles, also said to be 
acyclic. The relevant definition is the one below. 


Definition 5.31 A tree is a connected and acyclic graph. 


If T is a tree, a leaf of T is a vertex of degree 1, whereas a node of T is a vertex 
of degree greater than 1. It is an important fact that every tree has leaves, as we now 
establish. 


Lemma 5.32 Every tree with more than one vertex has at least two leaves. 
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Proof Let T be a tree with at least two vertices and consider, in T, a path C with 
the largest possible length. If C = (u1,u2,...,Un), we claim that vu; and u, are 
leaves. Indeed, if it was dr(u;) > 2, then uv; would be adjacent to uz and to some 
other vertex, say uo. Since trees do not have cycles, we must have up 4 u3,..., Un. 
Therefore, (uo, ¥1, U2..., Un) would be a path in T with length greater than that of 
C, which is an absurd. Hence, dz (u;) = | and, analogously, dr (uy) = 1. oO 


The previous lemma allows us to prove the coming proposition, which gives a 
necessary condition for a connected graph to be a tree. Soon hereafter we will show 
that such a condition will be also sufficient. 


Proposition 5.33 If T = (V; E) is a tree, then|E| = |V|—1. 


Proof We make induction on |V|. Firstly, it is clear that every tree with two vertices 
has exactly one edge. Assume, by induction hypothesis, that every tree withn > 1 
vertices has exactly n — 1 edges, and consider a tree T with n + | vertices. By the 
lemma, 7 has a leaf u; hence, T — u is a tree with n vertices, and the induction 
hypothesis assures that T — u has n — | edges. However, since T has exactly one 
edge more than T — u, it follows that T has exactly n edges. Oo 


Corollary 5.34 If G = (V; E) is a connected graph, then G contains at least one 
cycle if and only if |E| => |V\. 


Proof Suppose first that G contains at least one cycle. We shall show that | Z| > |V| 
by induction on |E|. If € is the edge of a cycle in G, then G — € is also connected 
and has |V| vertices and || — 1 edges. Then, there are two possibilities: 


¢ G-—€ contains a cycle: it follows from the induction hypothesis that |E| — 1 > 
IV]; 

* G-—e is acyclic: since it is still connected, G —€ is a tree. Therefore, the previous 
proposition guarantees that |V| = (|/E| — 1) +1=|E|. 


In any case, we concluded that |E| > |V]|. 
Conversely, if G is acyclic, then G is a tree, so that (again from the previous 
result) |E| = |V| —1 <|VJ. oO 


Corollary 5.35 Let T = (V; E) be a connected graph. If |E| = |V| — 1, then T is 
a tree. 


Proof If T has at least one cycle, it would follow from the previous corollary that 
|E| => |V|, which is a contradiction. Thus, T is connected and acyclic, hence, a 
tree. oO 


For what comes next, given a connected graph G, associate to each edge € of G 
a weight, i.e., a positive real number w(e€) (to motivate this concept, think of the 
vertices of the graph as several cities of some region and of the weights as measures 
of travel expenses between adjacent cities). This way, G becomes a weighted graph 
and the weight w(G) of G is defined by setting 
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w(G) = >) we), 


€cE 


with E = E(G). 

Given a connected graph G, a spanning tree of G is a spanning subgraph of G 
(cf. Definition 5.14) which is a tree. An interesting problem in the elementary theory 
of trees is the following one: given a weighted graph, find a spanning tree of least 
possible weight, said to be a minimal spanning tree. 

We deal with this problem in the coming proposition, due to the American 
mathematician of the twentieth century Joseph Kruskal. Well understood, since the 
set of spanning trees of a graph is finite (and nonempty, as we shall see), there always 
exist minimal spanning trees in weighted graphs. The point of Kruskal’s theorem is 
that the proof of it actually furnishes an algorithm for finding a minimal spanning 
tree. Such an algorithm is known in Computer Science as Kruskal’s algorithm or 
greedy algorithm. 


Proposition 5.36 (Kruskal) Jf G is a weighted graph, then G has minimal span- 
ning trees. 


Proof Form a subgraph T of G according to the following algorithm: 


(i) Choose an edge of G with least possible weight and put it in T. 

(ii) After having chosen a certain number of edges of G for T, take, among the 
edges not chosen yet, one of minimum weight and whose inclusion in T does 
not generate cycles. 

(iii) Repeat step (ii) as long as possible. 


Firstly, note that the steps above give, as result, a spanning tree T of G. Indeed, 
T is certainly acyclic; it is also connected (and thus a tree), for otherwise we could, 
due to the connectedness of G, execute the algorithm one further time. Moreover, 
if T was not spanning, there would exist in G a vertex u not belonging to T but 
adjacent to some vertex v of T. Without loss of generality, we may assume that u 
and v are such that w({u, v}) is minimal, among all possible choices of u and v as 
above. Hence, the algorithm would continue with the inclusion of the edge {u, v} in 
T, which is an absurd. 

Now, we let S be another spanning tree of G, and will show that w(T) < w(S). 

Since S and T have the same number of edges (cf. Problem 3), there exists an 
edge € of T such that € does not belong to S. Again by Problem 3, the inclusion of € 
in S generates a cycle C; but since T is a tree, there exists some edge ¢’ in C, distinct 
from € and not belonging to T. In particular, ¢€’ is in S. 

We claim that w(e) < w(e’). Indeed, if w(e) > w(e’), then, upon executing 
Kruskal’s algorithm, at the step we chose € we would have chosen e’, for it would 
have a smaller weight. However, since ¢’ is not in T, this gives a contradiction. 

Since w(€) < w(e’), changing ¢’ by € in S, we obtain a new generating tree S), 
such that w(S;) < w(S) and S; and T have one more edge in common than S' and 
T do. If T has k edges not belonging to S, then, repeating the above reasoning more 
k — | times, we obtain a sequence S}, S2,..., S% of spanning trees such that S$; = T 
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and 
w(Sk) S++: < w(S1) < w(S). 


Thus, w(T) < w(S). Oo 


We finish this section by counting the number of labelled trees (cf. Remark 5.13) 
with a given number of vertices and a given set of labels. This result is due to 
the nineteenth century English mathematician Arthur Cayley and, for the sake of 
simplicity, we omit the set of labels from its statement. 


Theorem 5.37 (Cayley) There are exactly n"~* labelled trees of n vertices. 


Proof The given formula is trivially true for nm = | andn = 2. Assume n > 2 and, 
without loss of generality, that the vertices of the trees will be labelled 1, 2, ..., 7. 
Let a, be the total number of such trees and, for 1 < i <n, let A; stands for the set 
of them such that the vertex with label i is a leaf. Since it is impossible that all of 
the n vertices are leaves, the inclusion-exclusion principle (cf. Theorem 2.1) gives 


an = |A,U...UA),| 


n—-1 


= > (—1)*1|4;,9...N Agl. (5.7) 


k=1 1<i, <-++<ip<n 


Now, given integers |< k <nand1 <i; <--- < ix <n, note that Aj,N... 
Ai, is the set of trees in which vertices i, i2, ..., iz are leaves. If T stands for one 
such tree, then 


Tijin...ig =T-— {i1, i2, sey ix} 


is a tree with n —k vertices, with labels in J, \{i1, i2, ..., ix}. The key to count | Ai, 
...(Aj,| 18 to look at how T can be reconstructed from 7;,;,...;,. To understand this, 
note that for a fixed 1 < j < k vertex i; must be joined to one of the n — k vertices 
of Tj, i5...i,3 since the involved choices are independent, we conclude that there are 
exactly (n — kk possible ways of choosing the vertices of 7;,;,..i, to which vertices 
ij, i2,..., ix will link themselves. Since there are a,_; possibilities for 7j,,..i,, We 
therefore conclude that 


|Ai, 9... Ag | = 2 — k)Fan_ x. (5.8) 


iy 
In the above formula, note that the right hand side depends solely on k, but not 

on ij, i2, ..., iz. Since there are exactly (7) ways of choosing integers | < ij < 
» < ig <n, Eqs. (5.7) and (5.8) give us 


5.3. Trees 149 


n—-1 
dn = Deve(?) (0 — k)dn—k. 
k=1 


Now assume, by induction hypothesis, that a, = /'~? for 1 < 1 < n. Then, the 
recurrence relation above gives 


n—-1 


ay, = )(-1)e! (;) @—0 2. 


k=1 


We finish the proof by applying the formula of Problem 3, page 40, with m = n — 2 
to get a, =n". Oo 


Problems: Sect. 5.3 


1. * Prove that the following claims on a graph G are equivalent: 


(a) G isa tree. 

(b) Given arbitrarily two vertices of G, there is a single path in G connecting 
them. 

(c) G is connected and every edge of G is a cutting edge (cf. Problem 16, 
page 143). 


Then, conclude that, in a tree with n vertices, there are exactly ) distinct paths. 
2. Let T be a tree with a node of degree k. Prove that T has at least k leaves. 
3. * Let G be a connected graph. 


(a) Prove that any two spanning trees of G have the same number of edges. 

(b) If T is a spanning tree of G and € is an edge of G that is not in T, prove that 
T U {e} contains cycles. 

(c) If G has a single spanning tree, prove that G is a tree. 

(d) If T is a spanning tree of G and G has k edges not belonging to T, prove that 
G contains at least k distinct cycles. 


4. Prove that the following algorithm also constructs minimal spanning trees of 
weighted graphs: 


i. Choose a vertex and, among all edges incident to it, the one with least 
possible weight. 

ii. After having chosen a certain number of vertices and edges, choose the edge 
of least weight, among all edges not yet chosen and that join an already 
chosen vertex to a not chosen yet one. 

iii. Repeat the previous item as many times as possible. 
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The above algorithm is due to the Dutch mathematician of the twentieth century 
Edsger Dijkstra, being known as Dijkstra’s algorithm. 


ab |ac jad |ae jaf | ag | be |bg |cd |de |ef | fg 
6 |3 |2 |4 |3 |7 |6 |6 |2 |3 |1 18 


5. Let G be a graph with set of vertices {a, b,c, d,e, f, g} and with edge weights 
given as in the following table: Find all possible spanning trees of minimal 
weight. 

For the coming problem, we shall use the fact, established in Problem 1, that 
a tree with n vertices contains exactly (5) distinct paths. 

6. Let T be a tree with n vertices and satisfying the following condition: it is 
possible to associate to each one of the n — | edges of T a natural weight, so 
that the (5) distinct paths in T have, in some order, weights equal to 1, 2,..., (s): 
The purpose of this problem is to show that, in this case, either n or n — 2 must 
be a perfect square. To this end, do the following items: 


(a) Show that the number of paths in T having odd weight is equal to: 


5(5). if (5) is even 


1 ((") +1), if () isodd 


(b) Show that it is possible to paint the vertices of T either blue or red, in such 
a way that two adjacent vertices have distinct colors if and only if the edge 
joining them has odd weight. 

(c) If the coloring of item (b) generates x blue vertices and y red ones, show that 
the number of paths in T with odd weight is equal to xy. 

(d) If (5) is even, show that n = (x — y)?; if (5) is odd, conclude that n — 2 = 


(x — y)*. 


5.4 Independent Sets and Cliques 


This last section on graphs is devoted to a somewhat more detailed study of complete 
subgraphs of a given graph, and is partially based on the classnotes [16]. I kindly 
acknowledge professor Samuel B. Feitosa for having allowed me to use them. 

Given a graph G, recall that a nonempty subset A of V(G) is said to be 
independent if any two vertices in A are nonadjacent. If G has n vertices, we can 
partition V(G) inn sets (each set consisting of a single vertex) such that each one of 
them is independent. Then, there exists a least natural number k such that V(G) can 
be partitioned into k independent sets. We name such a natural number according to 
the following 
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Definition 5.38 Given a graph G, its chromatic number, denoted 4 (G), is the 
least natural number k such that the set of vertices of G can be partitioned into k 
independent sets. 


The reason for the word chromatic in the above definition is due to the fact that, 
if V(G) = k and V(G) = A, U...U Ax, with Aj, ..., Ax independent sets, then, 
painting the vertices in A; with a color c; (where cj, ..., cx are distinct colors), 
we obtain a coloring of the vertices of G in which any two adjacent vertices have 
distinct colors; conversely, if k is the least possible number of distinct colors for 
which there exists a coloring of the vertices of G with k distinct colors and with 
no adjacent vertices of the same color, then the sets formed by the vertices of each 
color are k independent sets that partition V(G). 


Examples 5.39 


(a) If G is a bipartite graph (cf. Problem 16, page 133) with at least one edge, it is 
immediate to see that ¥(G) = 2. 

(b) If G is a 2-cycle with n vertices, then it is clear that V(G) = 2 if n is even 
(for we can color the vertices by alternating two colors along the cycle), but 
X(G) = 3 if n is odd (for in this case we need a third color for the last vertex 
of the cycle). 


Generalizing item (b) of the above example, the proof of the coming proposition 
describes a rather simple algorithm which guarantees that if no vertex of a graph has 
degree greater than k, then its chromatic number is at most k + 1. In all that follows, 
given a graph G we let 


A(G) = max{dg(u); u € V(G)} 


and say that A(G) is the maximum degree in G. 
Proposition 5.40 In every graph G, we have X(G) < A(G) +1. 


Proof Let A(G) = n and fix n + 1 distinct colors. Start by painting an arbitrary 
vertex of G with one of the colors. Then, assume that, at some moment, k < n 
vertices of G have been painted with the n + 1 given colors and in such a way that 
adjacent vertices have distinct colors. Choose a vertex u that has not been painted 
yet. Since dg(u) < A(G) = n, at most n of the given colors were used to paint 
vertices adjacent to u; hence, we are left with at least one of the n + | colors to paint 
u, so that it has a color distinct from those of all of its neighbors. oO 


The following definition furnishes a lower bound for the chromatic number of a 
graph. 
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Definition 5.41 An n-clique in a graph G is a subgraph of G which is isomorphic 
to Ky. The clique number of G, denoted w(G), is the greatest integer n > 0 such 
that G contains an n-clique.’ 


Since one needs n distinct colors to color the vertices of K;, in such a way that 
no two adjacent vertices have the same color, we have 


X(G) = w(G), 


for every graph G. On the other hand, we can estimate w(G) upon the intuition that, 
if a graph has lots of edges (when compared to its number of vertices), then w(G) 
must be large. Let us take a look at a relevant 


Example 5.42 Let G be a connected graph with n > 1 vertices. If G has more than 
| “| edges, then w(G) > 3. 


Proof By contraposition, assume that G does not contain a K3. 
Let u be a vertex of G with maximum degree, say k. We can assume that k > 1, 
for, otherwise, Euler’s Theorem 5.5 would give us at most 5 edges in G, and this 


would contradict the fact that + < i=] +1 forn > 1. 

Since G has no K3, the k vertices of G adjacent to u form an independent set. 
Therefore, every edge of G which is not incident to u is incident to some of the 
n — k — 1 vertices of G not belonging to Ng(u) U {u}. However, since each one of 
these n — k — | vertices has degree less than or equal to k, we conclude that G has 
at most 


k+(n—1—k)k = —k? +nk 


edges. 
In order to finish the proof, it suffices to note that 
2 
ee 
A 
so that we must have —k* + nk < (21. oO 


The coming problem brings an interesting application of the previous one. 


Example 5.43 (China) Given distinct real numbers x1, x2, ..., X,, prove that there 
; 2 . 
exists at most L[J ordered pairs (x;, x;) such that 1 < |x; — x;| < 2. 


7We call the reader’s attention for not confusing notations w(G) for the weight of a weighted graph 
and w(G) for the clique number of a graph; the first uses the Latin letter w, whereas the second 
uses the Greek letter w ( omega). 
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Proof Construct a graph G having the x;’s as vertices, with x; adjacent to x; if and 
only if 1 < |x; — x;| < 2. We claim that G does not contain K3’s. Indeed, if x;, x; 
and x, were the vertices of a K3 in G, with x; < x; < Xx, say, then 


Xk — Xi = (XK — Xj) + Oj — xj) > 14+1=2, 


which is a contradiction. Therefore, by the previous example we conclude that G 
2. 
has at most | "7 | edges. Oo 


In order to generalize the reasoning of Example 5.42 from K3’s to K7’s, let us 
firstly construct a generic example of a graph with n vertices and without K7’s. To 
this end, we start by assuming that / < n, for otherwise there is nothing to do; then, 
we use the division algorithm (cf. Proposition 6.7) to write 


n=(l—Dq+r, 


with 0 < r </—1. Then, partition the set of n vertices into /— 1 subsets Aj, Ag,..., 
Aj—1, such that 


[Ay] =...= |A-| = q +1 and |A;41| =... = |Ay-1| = @; 


that it is possible to get such a partition follows from the equality r(g+1)+(-—r-—- 
1I)g = @—Dq+r =n. Continuing, impose that each A; be an independent set, and 
also that each vertex of A; is adjacent to each vertex of A;, forall 1 <i, j </—1, 
with i 4 j. The graph G thus obtained is called a Turan graph, after the Hungarian 
mathematician of the twentieth century Paul Turan, to whom we owe part of the 
material of this section. 

The Turan graph just constructed does not contain any K7, since a K; contains 
l-cycles, whereas G does not contain any such cycle. On the other hand, by counting 
how many edges of K,, are not present in G, we conclude that it has exactly 


TaD := (5) -/(% : ') iio v(4) 


edges. Substituting g = 7; and doing some elementary algebra, we get 


a oe ae oe 
(en — ny (5.9) 


even though w(G) < /. 
The coming result (cf. [39]) assures that, if G is a graph with n vertices and such 
that w(G) < /, then G must have at most T(n; 1) edges. 


Theorem 5.44 (Turan) [fa graph G of n vertices has more than T(n; 1) edges, 
then w(G) > I. 
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Proof In the notations of the preceding discussion, let us make a proof by induction 
on q, assuming that r and / are fixed. If g = 0, then n = r and it follows from (5.9) 
that 


n2(1 — 2) —n(l—1—n) 


PR 2-1 
_ Wdl—1)—n-1) 
df —1) 


Wan | n 
2 2) 


Since a graph with n vertices cannot have more that (5) edges, the statement of the 
theorem is vacuously true.® 

Now, consider a graph G with n vertices, without K;’s and with the largest 
possible number of edges. Clearly, G contains a K)_1, for otherwise we could add 
at least one more edge to G and it would still contain no K;. Let H be a K)_; inG, 
and A be the set of the remaining n — / + 1 vertices of G (i.e., those not belonging 


to H). There are three kinds of edges in G: 


¢ The edges of H: overall, Cs) edges. 

e The edges joining vertices in A to vertices in H: since G contains no Kj’s, each 
vertex in A is adjacent to at most / — 2 vertices in H. This gives us at most 
(n —1+ 1)( — 2) edges of this kind. 

° The edges of the induced subgraph Gy, (cf. Problem 12, page 132): since Gj, 
does not contain K;’s too, andn —1+ 1 = (g — 1)@ — 1) +1, we can apply the 
induction hypothesis to Gj,, so that G;4 has at most T(n — 1 + 1; 1) edges. 


Therefore, the graph G has at most 


Pin-14 D+ n-14pd-2+ ("5") 


edges. 
Finally, substituting the formula for T(n — 17+ 1; 1) (obtained from (5.9)) and 
performing some elementary algebra once more, we get 


Pn-14 D+ @-1+ 0-9 +(' ; 


1 
) =T(n;)), 


so that G has at most T (n; /) edges. oO 


8In terms of Logic, an implication is true whenever it premise is false. See Chapter 1 of [34], for 
instance. 
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A nice way of grasping the details of the proof of Turan’s theorem is to rework it 
in the particular case (/ — 1) | n. In this respect, we refer the reader to Problems 2 
and 5. 

We finish this section by examining yet another invariant of a graph G, one which 
is closely related to w(G). 


Definition 5.45 Given a graph G, its independence number, denoted a(G), is the 
number of elements of a maximal independent set of vertices of G. 


For the coming proposition, the reader may find it useful to recall the concept of 
complementary graph (cf. Problem 9, page 132). 


Proposition 5.46 Let a graph G and a nonempty subset A of the set of vertices of 
G be given. The subgraph of G induced by A is a clique in G if and only if A is an 
independent set in G. In particular, 


w(G) =a(G) and w(G) = a(G). 


Proof The first part follows immediately from the definitions. For the second part, 
it suffices to use the fact that G = G. oO 


Example 5.47 Let G be a graph with nine vertices. Suppose that, given any five of 
the vertices of G, there exist at least two edges with both endpoints among them. 
Compute the least possible number of edges G can have. 


Proof We consider two cases separately: 


(a) w(G) > 4: fix a K4 in G and let A be the set formed by the vertices of this K4. 
Then, A is independent in G and, if u is one of the five remaining vertices, the 
stated condition assures that u is adjacent to at least two of the vertices in A. 
Thus, we conclude that G has at least 5 - 2 = 10 edges. 

(b) w(G) < 3: by Turan’s theorem, G has at most T(9; 4) = 27 edges and, hence 
G has at least () — 27 = 9 edges. 


For an example of such a G with 9 edges, take the union of three pairwise disjoint 
K3’s. Given five vertices in G, there are two possibilities: either three of these five 
vertices form one of the K3’s of G, or we choose the five vertices by taking two 
vertices of two of the K3’s of G, together with one vertex of the third K3. In any 
case, there are at least two edges of G having both of its endpoints among the five 
vertices. Oo 


Problems: Sect. 5.4 


1. If G is a graph with at least six vertices, prove that w(G) > 3 or w(G) > 3. 
2. The purpose of this problem is to provide a slightly different proof of the result 
of Example 5.42 when G is a graph of 2g vertices, g > 1, and at least g? + 1 
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1S) 


10. 


11. 


12. 


13. 
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edges. To this end, fix two adjacent vertices u and v of G and do the following 
items: 


(a) If at least 2g — 1 edges depart from u or v to the remaining 2g — 2 vertices, 
use the pigeonhole principle to guarantee that G has a K3 of the form uuw. 
(b) If at most 2g — 2 edges depart from u or v to the remaining 2g — 2 vertices, 
apply an appropriate induction hypothesis to G — {u, v} (cf. Problem 11, 
page 132) to conclude that such a graph (and, hence, also G) contains a K3. 


. Solve Problem 3 at the beginning of this chapter. 
. Let G be a graph with 10 vertices and 26 edges. Show that G must contain at 


least five K3’s. 


. Let/ > 3 and q be natural numbers, and G be a graph of (/ — 1)q vertices, with 


at least 5(l — 1)(1 — 2)q? edges.’ Argue along the same lines of Problem 2 to 
show that G contains a K7. 


. (TT) Twenty teams entered a soccer tournament. Find the minimum number of 


matches such a tournament must have so that, among any three teams, at least 
two have played against each other. 


. Ina given a set of n people, it is known that out of any group of three of them 


at least two are acquainted with each other, whereas out of any group of four 
at least two are not acquainted with each other. Compute the largest possible 
value of n. 


. (TT) We are given 21 points on a circle. Prove that there exist at least 100 arcs, 


each of which having two of the given points as endpoints and whose measures 
are less than or equal to 120°. 


. (Poland) Let n points on a circle with radius 1 be given. Show that there is at 


most a line segments joining pairs of these points and having lengths greater 


than /2. 

(USA) In a certain society, each pair of people is classified either as amicable 
or hostile. Members of amicable pairs are said to be friends and members of 
hostile pairs are said to be enemies. Suppose that the society has n people, qg 
amicable pairs and that, among any three people, at least two form an hostile 
pair. Prove that there exists at least one member of the society whose set of 


. . 4, r , 
enemies contains, at most, no more than g (1 - 4) amicable pairs. 


Let G be a graph with n vertices, such that w(G) < k, where k > 2. Prove that 
G contains at least | ¢4,| vertices with degrees less than or equal to | hn i 
A set of 1001 people is such that each subset of 11 people contains at least two 
who know each other. Show that there exist at least 101 people in the set such 
that each one of them knows at least 100 other people within the set. 

(China) Let A be a subset of § = {1,2,3,..., 10°} having 101 elements. Prove 
that there exist x1, x2, ..., x109 in S such that, for 1 < j < 100, the sets 


Aj = {x + xj; x € A} are pairwise disjoint. 


°Note that this is precisely Turan’s number T(n;/) whenn = (/ — L)q. 


Chapter 6 m®) 
Divisibility spooks 


This chapter is devoted to the elementary properties concerning the relation of 
divisibility in the set of integers, with a particular emphasis on the division 
algorithm, on the notion of greatest common divisor and the fundamental role 
played by prime numbers. In spite of the elementary character of the arguments 
we shall use, we will meet several interesting problems and results along the way, 
like Bézout’s theorem on the characterization of the greatest common divisor of two 
integers and Euclid’s theorem on the infinitude of primes. 


6.1 The Division Algorithm 


We start by isolating our central object of study, namely, the relation of divisibility 
between two integers. 


Definition 6.1 Given a, b € Z, with b $ 0, we say that b divides a, and write b | a, 
if there exists c € Z such that a = bc. In case b doesn’t divide a, we write b { a. 


Let b be a nonzero integer. If b divides a, we also say that b is a divisor of a, 
that a is divisible by b or that a is a multiple of b. If b | a and b > 0, then bisa 
positive divisor of a. Note that every nonzero integer is a divisor of itself and of 0. 
Also, +1 divide every integer. 

The coming example starts by formalizing two concepts we are certainly familiar 
with. 


Example 6.2 An integer n is even provided it is a multiple of 2; otherwise, n is said 
to be odd. According to Definition 6.1, the even integers are precisely those which 
can be written in the form 2k, for some k € Z, i.e., the integers 


..., —6, —4, —2,0,2,4,6,.... 
© Springer International Publishing AG, part of Springer Nature 2018 157 


A. Caminha Muniz Neto, An Excursion through Elementary Mathematics, Volume III, 
Problem Books in Mathematics, https://doi.org/10.1007/978-3-319-77977-5_6 


158 6 Divisibility 
The remaining integers, namely, 
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are the odd ones. Thus, every odd integer is one plus an even integer, so that can be 
generically denoted by writing 2k + 1, where k € Z. 


The next example isolates an application of a simple algebraic identity which 
will be quite useful for us. 


Example 6.3 Let a and b be two given integers and k be a natural number. 


(a) Ifa <b, then (a — b) | (ak — b*), 
(b) Ifa 4 —b and k is odd, then (a + b) | (ak + b*). 


Proof Standard basic algebra (cf. problem 2.1.18 of [8], for instance) gives 
a® = b* = (a= ba! + ab +e pape? ep!) 
and, for odd k, 


a® + B® = (a+ b)(a*—! — a7 4+. -- — abt? +. BF, 


Since both ak! +a’? +.--+.abk-? + bE! and ak! — ak 7b +.» abk-7 + 
b‘—! are integers, it suffices to recall Definition 6.1. oO 


Example 6.4 Prove that, for every k natural, 10‘ — 1 is divisible by 9. 
Proof It suffices to apply item (a) of the previous example: 10 — 1 = 10* — 1* is 
divisible by 10 — 1 = 9. Alternatively, note that 


106 -—1=99...9=9-11...1, 
—S{ —— 
k digits k digits 


which is clearly a multiple of 9, or write, with the aid of the binomial formula, 


k 


k-1 
k ‘ k\ , ; 
of -1=9+18-1= 90 (i Fi —1=9) (\, 
J ; J 
j=0 


j=0 
again a multiple of 9. oO 


The following proposition establishes some basic properties of the relation of 
divisibility. The reader should make an effort to keep them in mind for future use. 


Proposition 6.5 Leta, b, c be nonzero integers and x, y be any integers. 


(a) Ifb | aanda | b, thena = +b. 
(b) Ifc | band b | a, thenc | a. 
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(c) Ifc | aandc | b, thenc | (ax + by). 
(d) If b | a, then |b| < |al. 

(e) Ifc |b, thenc | ab. 

(f) Ifb | a, then bc | ac. 


Proof 


(a) If a’ and b’ are integers such that a = ba’ and b = ab’, then a = (ab’)a’ = 
a(a'b’), so that a’b’ = 1. Therefore, a’ = +1, and it follows that a = +b. 

(b) If a = ba’ and b = cb’, with a’,b’ € Z, then a = (cb’)a’ = c(a'b’), with 
a'b’ € Z. Thus, c | a. 

(c) Let a = ca’ and b = cb’, with a’, b’ € Z. Then ax + by = ca'x + cb'y = 
c(a’x + b’y), with a’x + b’y € Z. We conclude that c | (ax + by). 

(d) Let a = ba’, witha’ € Z. Then, a 4 0 = a’ S O, and hence |a’| > 1. This 


gives |a| = |ba’| = |b||a’| = |b. 
(e) Ifb = cb’, with b’ € Z, then ab = c(ab’), with ab’ € Z. Therefore, c | ab. 
(f) Ifa = ba’, witha’ € Z, then ac = (bc)a’, and hence be | ac. Oo 
Remarks 6.6 


i. As a particular case of item (c) of the previous proposition, if c | a and c | b, 
then c | (a +b). We shall use this remark several times in what follows. 
ii. Item (c) of the previous proposition can be easily generalized to prove that, if 
c|aj,...,@y, thenc | (ajxj +---+dyx,) for all x1,...,x, € Z. 
iii. It follows from item (d) above that every nonzero integer has only a finite 
number of divisors. 


We continue by recalling that the integer part |x| of x € R is defined by 
[x] = max{n € Z; n < x}. (6.1) 
In other words, given n € Z, 
lxJ=non<x<n+l. (6.2) 


For instance, since 1 < /2 < 2 we have |/2| = 1; on the other hand, since 
—3 < —2.3 < —2, then |—2.3] = —3. 

Our next result is usually referred to as the division algorithm! for integers, and 
is one of the cornerstones of the basic theory of divisibility. 


Proposition 6.7 Given a,b € Z, with b ¥ 0, there exist unique q,r € Z such that 
a=bq+rand0 <r < |b|. Moreover, if b > 0, then 


‘At this point, the reader may wish to take another look at the definition of algorithm, at the 
footnote of page 116. 
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=p eee ale 63) 
= || 4nd f= pala : 
Such integers q and r are respectively called the quotient and the remainder upon 
division of a by b. 


Proof Suppose first that b > 0, and let g be the greatest integer such that bg < a. 
Then bg <a < b(q+1), so that 0 < a—bq < band it suffices to define r = a—bq. 
Note also that, since 


ae +1 
q@=;5 q , 


such an integer g is precisely | ¢| : 
If b < 0, then —b > 0, so that there exist g,r € Z for which a = (—b)q’ +r, 
with 0 < r < —b. Hence, a = b(—q’) +r, withO < r < —b = |b|. Make g = —q’. 
Now, suppose that a = bg+r = bq'+r’, witha, q',r,r’ € Zand0 <r,r’ < |DI. 
Then, 


|r’ —r| < |b] and b(g—q')=r'—r. 
If g #q’, then |g — q'| > 1, so that 
lbl < lhl -lg—q'|=Ir' -r| < [Bl, 


a contradiction. Therefore, g = q’ and hence r = r’. o 


The proof of the following corollary illustrates a typical application of the 
division algorithm. For its statement, recall that a nonnegative integer m is a perfect 
square provided m = n?, for some nonnegative integer n. 


Corollary 6.8 Every perfect square leaves remainder: 


(a) Oor 1 upon division by 3. 
(b) O or 1 upon division by 4. 
(c) 0, 1 or 4 upon division by 8. 


Proof Let n be a natural number. 


(a) By the division algorithm, n leaves remainder 0, 1 or 2 upon division by 3, so 
that n = 3q, 3q + 1 or 3g + 2, for some g € Z. Now, let’s consider these three 
possibilities: 


* Ifn = 3q, thenn? =3-3q?. 
* Ifn =3q +1, then n? = 3(3q7 + 2q) +1. 
* Ifn = 3q +2, thenn? = 3(3qg7+4qg +1) +1. 


Since 3q7 € Z, the uniqueness part of the division algorithm assures that, in 
the first case above, n” leaves remainder 0 upon division by 3. By the same 
argument, in the other two cases n? leaves remainder 1 upon division by 3. 
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(b) By the division algorithm, the remainder upon division of n by 4 is 0, 1, 2 or 
3, so thatn = 4q,4q + 1,4q + 2 or 4q + 3, for some g € Z. Hence, we can 
certainly give to this item a proof that parallels that of (a). Nevertheless, let’s 
see a simpler one: again by invoking the division algorithm (or Example 6.2), 
we can write n = 2q or 2q + 1, for some gq € Z. 


© Ifn = 2g, then n? = 4q?. 
° Ifn = 2q +1, thenn® = 4(q? +q) +1. 


Hence, upon division by 4, in the first case n? leaves remainder 0, whereas in 
the second it leaves remainder 1. 

(c) Once more, we could give a proof of this fact as a direct application of the 
division algorithm, by writing n = 8q +r for some gq € Zand somer € 
{0, 1, 2,..., 7}. However, in order to avoid having to consider eight different 
cases, we proceed as in the proof of item (b): 


* Ifn = 2g, then n? = 4q7. Now, there are two possibilities: 


— If q? is even, say gq? = 2k for some k € N, then n* = 8k. 
— If q? is odd, say q* = 2k + 1 for some k € N, then n? = 8k +4. 


* Ifn = 2q +1, thenn? = 4q(q + 1) + 1. However, since one of the integers 
q and q + | is even, in any case the product q(qg + 1) is even, so that we can 
write g(q + 1) = 2k for some k € N. Then, n* = 4- 2k +1=8k+1. 


Finally, as in the proof of the previous items, the expressions for n* obtained 
above guarantee that its possible remainders upon division by 8 are exactly 0, 1 
or 4. o 


The following elaboration of the division algorithm with play an important role 
in subsequent discussions. 


Corollary 6.9 Let aj, az and b be given integers, with b being nonzero. Then, b | 
(a, — az) if and only if a, and az leave equal remainders upon division by b. 


Proof Suppose first that aj = bq, +r and az = bq2 +r, with qi, g2, r € Z. Then, 
ay — a2 = b(qi — q2), 1e., b | (ay — a2). 

Conversely, assume that b | (a; — az) and let ay = bq, +r and a2 = bq2 +12, 
with g1, 92,711,712 € Zand 0 <r}, r2 < |b|. Then, 


a, — a2 = (Qi — q2)b + (r1 — 12) 


and, since b | (a; — az) and b | (qi — qz2)b, item (c) of Proposition 6.5 guarantees 
that b divides r} — r2 = (a, — a2) — (qi — q2z)b. On the other hand, inequalities 
0 <7r1,17 < |b| give |r; — r2| < |b|, so that the only possibility for b dividing 
r, — r2 is having |r; — r2| = 0 or, which is the same, r) = r2. oO 


As an application of the previous result, the coming example computes the 
possible values of the last digit of a perfect square. 
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Example 6.10 Every perfect square ends in 0, 1, 4, 5, 6 or 9. 


Proof First of all, notice that the remainder of a natural number upon division by 10 
coincides with its last digit (i.e., the rightmost one, in the decimal representation of 
the number). Indeed, let m = 10m’ + ag, with m’ € Z and 0 < ag < 9; since 10m’ 
ends in 0, it follows that the decimal representation of the sum 10m’ + ao (which 
equals m) has ag as rightmost digit. 

Now, let n € N and g,r € Zbe such that n = 10g +r, with O < r < 9. Then, 


n> = (10g +r)* = 100g? + 20gr + r* 
= 10(10k? + 2kr) +r’, 


so that 10 | (n? — r?), and Corollary 6.9 guarantees that n? and r? leave equal 
remainders upon division by 10. Hence, our initial remark assures that the last digit 
of n? equals that of r?. 

In order to finish the proof, it suffices to check which are the possible values of 
the last digit of r2 when r varies from 0 to 9: 0? = 0; 1? and 9? end in 1; 2? and 
82 end in 4; 3? and 7? end in 9; 4? and 62 end in 6; 5? ends in 5. Thus, the possible 
values of the last digit of n? are 0, 1, 4,5, 6 or 9. oO 


Example 6.11 Let n > 1 be an integer. Show that any sequence of 1 consecutive 
integers contains exactly one multiple of n. 


Proof Leta+ 1,a+2,...,a +n be a sequence of n consecutive integers, with 
a=nqt+r,0<r <n. Then,0 <n-—r <n, so that a + (n — 1) is one of the 
numbers of our sequence. Now, note that 


at+(n—r)=(nqg+r)+(n—-r)=nq@+), 


which is a multiple of n. 

To what is left to do, we shall apply again the result of Corollary 6.9: the numbers 
a+1,a+2,...,a-+n leave pairwise disjoint remainders upon division by n, since 
the modulus of the difference between any two of them is at least 1 and at most 
n — 1. Therefore, our sequence contains at most one multiple of 7. oO 


Problems: Sect. 6.1 


1. Prove the divisibility criteria by 3 and by 9: the remainder of the division of 
a natural number by 3 (resp. by 9) equals the remainder of the division by 3 
(resp. by 9) of the sum of the digits of its decimal representation. In particular, 
a natural number is a multiple of 3 (resp. of 9) if and only if the sum of its digits 
is So. 

2. Find the remainder of the division of 10* by 11. 


6.1 


10. 


11. 


12. 
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. Prove the criterion of divisibility by 11: if the natural number n has decimal 


representation n = (axag—1 ...@1d0)10, then its remainder upon division by 11 
equals that of the alternating sum 


ag — a1 tag —...+ (—1)* lay_1 + (—D)*ay 


upon division by 11. In particular, 11 | ” if and only if 11 divides the alternating 
sum of the digits of n. 


. (IMO) Find all natural numbers n of three digits, such that 11 | n and 7 is 


equal to the sum of the squares of the digits of n. 


. (IMO) Find all n € N such that the product of the digits of the decimal 


representation of n is equal to an* — 10n — 22. 


. * Given a natural number 1, prove that the product of any 1 consecutive integers 


is always divisible by n!. 


. (Hungary) For n € N, prove that 8” — 3” — 6” + 1” is a multiple of 10. 
. (Hungary) Prove that 5 divides 19 + 2°? + 399 + 499 4 599. 


For the next problem, given m € Z we shall let Zm denote the set 


nZ = {nq; q € Z} = {0, +n, +2n, +3n,...}. 


. Let S be a set of integers containing 0 and satisfying the following condition: 


x, yEeS,xAyoaux-yes. 


If S # {0}, prove that: 


(a) SANFYD. 
(b) Ifx, ye S,thenx+yeS. 
(c) Ifn = min(S NN), then S = nZ. 
* Compute the remainder of the division of 2° + 1 by 23% + 1. More generally, 
given integers a, m and n, all greater than | and such that m > n, compute the 
remainder of the division of a2” + 1 by a2” + 1. 

The coming problem generalizes the binary representation of natural num- 
bers, established in Example 4.12 of [8]. 
* Let m,a e€ N, with a > 1. Show that there exist unique integers 
k,mo,m,...,mx, wih k => 0,0 < mj < a—I1for0 < j < k,mz > 0 
and 


k k=1 
m=ma +mgz_ ja +++++mja+tmo. 


The above expression for m is known as its representation in base a. 
* For x € R, prove the following items: 

(a) x -1 < |x] <x. 

(b) [x +1] = [x] +1. 
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13. 


14. 
15. 


16. 


17. 


18. 


19. 
20. 


21. 
22. 
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0, ifxeZ 
(c) al+ioni=| Seg 
* Let x, y € Randm,n € Z, withn > 0. Prove the following items: 
(a) x<y=> |x] < Ly]. 
(b) [nx] >= n(x]. 
(c) mc | <|[#)+1. 
@) | et] = [4]. 
(e) [el + Ll < ley) = la el ed 
(f) ly+y] +14) + Ly] <= 2x] 4+ Ly]. 
For x € R, prove that its closest integer is given by ls + 3|. 
(ORM) Find all a, b € N such that 


oe fe era [Oe | pan 


(Hungary) Prove that there are no odd integers x, y and z such that 


(x Agr 4. (y +z)" = (x +2). 


Let x, y and z be integers such that x7 + y* = 2”. Prove that 6 | xy. 

For the coming problem, recall that a perfect cube is an integer of the form 
q°, for some q € Z. 
(Brazil—adapted) 


(a) Find all possible remainders upon the division of a perfect cube by 7. 
(b) If. x, y, z are integers such that x? + y? — z3 is a multiple of 7, prove that at 
least one of x, y, z is a multiple of 7. 


(Brazil) Find all natural numbers x, y, z, n such that n* +n? = né%. 

(Soviet Union) For n > 3 integer, show that 1! + 2!+ 3!+.---+n! is never a 
perfect square. 

(Brazil) Prove that there doesn’t exist integers x and y such that 15x*—7y? = 9. 
(France—adapted) For k, m,n € N, with k > 2, let Fyn = 27" + 1 be the m-th 
Fermat’s number? and G,, = mn) (k — 2) +n. For a fixed k, we want to find 
(if any) all m,n € N such that F,, = G,,. To this end, do the following items: 


(a) Prove that, if there exists such a pair (m,n), then there also exists a 
nonnegative integer r such that n = 2” + 1. 


? After Pierre S. de Fermat, French mathematician of the seventeenth century. As the perseverant 
reader will testify, along the rest of this book we shall have a lot more to say on the enormous 
legacy of Fermat to elementary Number Theory. 
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(b) Conclude from (a) that k — 1 = 2’t; for some t) € N, so that 22” + 1 = 
277 (2"t +t — 1). 

(c) Use induction on/ € N to show that if we had 22” +1 = 20+)" (2° 44441) 
for some t; € N, then 22" + 1 = 22)" (2" 44 +4141). 

(d) Conclude on the existence or not of such a pair (m,n). 


23. (Russia) For each positive integer m, let s(m) denote the sum of the digits of 
the decimal representation of m. Prove that there exist infinitely many natural 
numbers n for which s(3”) > s(3"+!), 
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We say that a nonzero integer d is a common divisor of given nonzero integers 
aj,...,dnifd|aj,,...,d | ay. Note that a, ..., ad, always have common divisors: 
1, for instance. Moreover, since every nonzero integer has only a finite number 
of divisors, a),..., 4, have only a finite number of common divisors. Thus, the 
following definition makes sense. 


Definition 6.12 The greatest common divisor of the nonzero integers a), a2, ..., 


an, denoted gcd(a1, a2, ..., Gn), 18 (as its name says) the greatest one of the common 
divisors of a, a2, ..., @,. Nonzero integers aj, a2, ..., dy are said to be pairwise 
coprime or relatively prime, if gcd(a), a2,...,d)) = 1. 


The coming result is due to the French mathematician Etienne Bézout.’ For its 
statement, given n € Z we let nZ denote the set of integer multiples of n, namely, 


nZ = {nx; x € Z} = {0, +n, +2n, +3n,...}. 


Theorem 6.13 (Bézout) Let a1, a2, ..., a, be nonzero integers. If 
n 
s= |S aixis x €Z,V1 sisal}, 
i=l 


then S = dZ, with d = gcd(a,, a2, ..., dy). In particular, there exist uj, uo, ..., 
uy, € Z such that 


gcd(a, d2,...,dn) = ayuy +aquz +--+ + ayy. (6.4) 


3French mathematician of the eighteenth century, Bézout is one of the precursors of the area of 
Mathematics known today as Algebraic Geometry. In spite of the coming result, his most famous 
theorem is probably the one which states that, if f and g are real polynomials in two indeterminates 
X, Y, with no common factors and with degrees respectively equal to m and n, then the number of 
points of intersection of the curves f(x, y) = 0 and g(x, y) = 0 in the cartesian plane is at most 
mn—for example, two distinct ellipses have at most 2-2 = 4 points in common. For an elementary 
exposition, we recommend [18] or [31]. 
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Proof It’s immediate to see that every multiple of an element of S belongs to S, i.e., 
uEeSS>uZcs. (6.5) 


On the other hand, since d divides a,x, +a2x2+:+-+ayXpy for all x1, x2,...,Xn € Z, 
we certainly have S Cc dZ. 

In order to establish the opposite inclusion, note first that S does contain positive 
integers; indeed, by choosing x} = a; and x2 = --- = x, = 0, for instance, we 
conclude that 


a? = ayxy + agxg +--+ + anxy € S. 


Since S contains positive integers, there exists a least positive integer d’ in S. If 
we show that d’ = d, we will get d € S, and (6.5) (with d in place of u) will give 
dZc S. 

We first claim that d’ | a1, ..., dy. Indeed, since d’ € S, there exist uw}, u2,..., 
un € Z such that d’ = ayuy + agu2 + +++ + adyuyn. Now, let ay = d’q +1, with 
q.r € Zand 0 <r < d’. Then, 


r=a, —d'q 
= a) — (au, + aqu2 + +--+ anun)g 


= a,(1 — 44g) + a2(—u2g) +++» + Gn (—Ung), 


so thatr € S. If 0 < r < d’, we would get a contradiction to the fact that d’ 
is the least positive integer in S. Therefore, r = 0 and d’ | a,. Analogously, d’ | 
a2,..+,4y- 

Finally, since d’ is a common divisor of a), a2, ..., Gy, in order to get d’ = d is 
suffices to show that d’ > d. To this end, if a, = dq), ag = dq2, ..., Gy = dn, 
with g1,92,-.--,4n € @, then 


d’ =ayuy t+anua +--+ + anuy 


= dqiu, + dqou2 +--+ +dgnun 
= d(qiuy + qou2 +-++ + dnUn), 


so that 0 < d | d’. Thus, d < d’. oO 


Remark 6.14 In the notations of Bézout’s theorem, one uses to say, thanks to (6.4), 
that gcd(a1, a2, ..., Gp) is a linear combination of a), a2, ..., dy, with integer 
coefficients. For the time being, we shall concentrate on the existence of such 
a way of representing gcd(a1, a2,...,d,). However, an actual way of getting 
Uj,U2,...,Un € Zas in (6.4) will be an easy consequence of Euclid’s algorithm 
(cf. Proposition 6.27), together with item (b) of Corollary 6.23. 
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In what follows, we collect several useful consequences of the previous theorem. 


Corollary 6.15 Let aj, a2, ..., dn be nonzero integers with gcd equal to d. If d' € 
N, then d' | aj, a2, ..., Gy if and only if d’ | d. 
Proof Take integers u1, u2,..., Uy, such that d = aju, + aqu2 +--+ + dyun. Since 
d' | aj, a2,..., 4p, item i. of Remarks 6.6 guarantees that d’ | d. The converse is 
immediate. oO 
Corollary 6.16 Let a), a2, ..., dy be nonzero given integers with gcd equal to d. 
(a) d =1 if and only if there exist integers uj, U2, ..., Un such that aju, + a2u2+ 
+++ + yun = 1. 
a, a2 an\ _ 
(b) ged (4, 9,..., 4) =1 
Proof 
(a) If d = 1, the existence of such integers u1, u2, ..., Uy, follows from Bézout’s 
theorem. Conversely, let v1, u2,..., U, be a set of integers satisfying the stated 
conditions. Since d | dj, d2,..., dy, it follows once more from item i. of 


Remarks 6.6 that d | (ajuy +--+: + Gyuy), 1.e., d | 1. Thus, d = 1. 
(b) Letting d = ayuy + aguz +--+ + dyUn, we have (4) i i (4+) Un = 1. 
Therefore, the desired result follows directly from item (a). 


Example 6.17 (China) Let a, b, c and d be nonzero integers, such that c +d #4 0 


and ad — bc = 1. Prove that ab is an irreducible fraction. 


Proof We wish to prove that gcd(a + b,c + d) = 1. To this end, and according to 
the previous corollary, it suffices to find integers u and v such that 


(a+bju+(ct+d)v=1. 


Since ad — bc = 1, one readily notices that u = d and v = —b is a possible choice. 
oO 


Remark 6.18 Let a, a2, ..., dn be nonzero integers with gcd equal to d. Letting 
uj = “. for | <i <n, item (b) of the previous corollary assures that uj, v2, ..., Un 
are relatively prime integers, for which a} = du}, a2 = du2, ..., dy = dun. 


The above remark gives a way of writing nonzero integers a), a2, ..., d, which 
will reveal itself to be useful in a number of situations. The two coming examples 
stress this point. 


Example 6.19 Every nonzero rational number can be written as an irreducible 
fraction, i.c., has a fractional representation of the form - where a and b are 
nonzero relatively prime integers. Indeed, let r be a nonzero rational number and 
r = ™ bea fractional representation of it. Then, m and n are nonzero integers and, 


letting d = gcd(m, n), we get m = da and n = db, with gcd(a, b) = 1. This gives 
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=|s 
QQ 
ES 
S 


abo he 
which gives the desired fractional representation of r. 


Example 6.20 (Russia) Let a and b be natural numbers such that ott + pot EN. 
Prove that 


gcd(a,b) < Va+b. 


Proof Let d = gcd(a, b), so that there exist integers uv and v such that a = du, 
b = dv and gcd(u, v) = 1. Then, 


Cae ae er ae 
b a dv du 
__ u(du + 1) + v(dv + 1) 


duv 


— dw? +v?)+(u+v) 
. duv 


N. 


In turn, this assures that 


+u 


2 2 
y. a uae a Anas ee ee re 


duv 


is also a natural number. Hence, d | (u+ v), so that d < u+ v. But this gives 


d* <du+dv=at+tb. 


Corollary 6.21 Given nonzero integers a, a2, ..., An, k, we have: 


(a) gcd(kay, kaz,..., kay) = |k| gcd(a1, do, ..., dy). 
(b) gcd(ay, AQ, +++, a) i gcd(ged(ay, AQ, +++, Qn—1), Qn): 


Proof 


(a) Let d = gced(q,...,a,) and d’ = gcd(kaj,..., kay). Since d divides all of 
a\,..., Gn, we conclude that |k|d divides all of ka,, ..., kan, so that |k|d is a 
positive common divisor of kaj, ..., ka,. However, since d’ is the largest of 
such common divisors, it follows that Ikld < d’. = COMVElse'y, since k divides 
d' and d' divides all of kay, ..., kan, we get that 4 Tel is a positive integer which 


divides all of a), ..., d,. This gives sf < d or, which is the same, d’ < |k\d. 
Therefore, d’ = \kld. 
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(b) Let d = gcd(aj,...,a,) and d’ = gcd(gced(aj, a2, ..., 4n—1), An). Since d | 
a\,..-,An—1, we get from Corollary 6.15 that d | gcd(aj,..., d,—1). Then, d 
divides both gcd(aj, ..., dn—1) and dy, so that (once again from Corollary 6.15) 
d divides the gcd of these two numbers, that is, d | d’. A similar reasoning shows 
that d’ | d, thus finishing the proof. 

Oo 


We specialize our discussion to the greatest common divisor of two nonzero 
integers. Given a,b € Z \ {0}, with d = gcd(a, b), Bézout’s theorem guarantees 
the existence of integer u and v such that d = au + bv. It is important to note that 
such a way of writing gcd(a, b) is not unique; actually, if t € Z, then we also have 
d = a(u—tb)+b(vu+ta). We shall have more to say about this in Proposition 6.26. 

Later, we will establish a quite important and useful algorithm for effectively 
finding the gcd of two nonzero given integers, namely, Euclid’s algorithm. We start 
by studying some properties of the gcd of two nonzero integers. 


Proposition 6.22 For any nonzero integers a, b and c, we have that: 


(a) Ifc | ab and gcd(b, c) = 1, thenc | a. 

(b) Ifa+bc £0, then gcd(a + bc, b) = gcd(a, b). 

(c) If gcd(a, c) = 1, then gcd(a, bc) = gcd(a, b). 

(d) Ifc | b and gcd(a, b) = 1, then gcd(a, c) = 1. 

(e) If gcd(b, c) = 1, then gcd(a, bc) = gcd(a, b) gcd(a, c). In particular, if b and 
c are relatively prime and divide a, then bc divides a. 


Proof 


(a) Letu, v € Zbe such that bu++cv = 1. Multiplying both sides of this equality by 
a, we get (ab)u+c(av) = a. Finally, since c | (ab), item (c) of Proposition 6.5 
shows that c | a. 

(b) Let d = gced(a + bc, b) and d’ = gcd(a,b). Since d’ | a,b, we have that 
d' | a,a + bc. Hence, by Corollary 6.15 we get d’ | d. Conversely, since 
d | (a+ bc) and d | b, we conclude that d | [(a + bc) — bc], i.e., d | a and 
d | b. Again by Corollary 6.15, we obtain d | d’ and, thus, d = d’. 

(c) Set d = gcd(a, b) and d' = gcd(a, bc). From d | b it follows that d | be. 
Therefore, d | a and d | bc, so that d | gcd(a, bc) = d’. In order to finish, 
let us show that d’ | d: since gcd(a, c) = 1, Bézout’s theorem guarantees the 
existence of u, v € Z such that au + cv = 1. This way, a(bu) + (bc)v = b, and 
such an equality, together with the fact that d’ | a and d’ | bc, proves that d’ | b. 
Then, d’ | a and d’ | b, so that d’ | gcd(a, b) = d. 

(d) Let d € Zbe such that b = cd, and choose u, v € Z for which au + bu = 1. 
Then, au + c(dv) = 1, and item (a) of Corollary 6.16 gives gcd(a, c) = 1. 

(e) Once more, set d = gcd(a, b) and write a = du and b = dv, with u,v € Z 
relatively prime. By successively applying Corollary 6.23 and the result of item 
(c), we obtain 


gcd(a, bc) = gcd(du, duc) = d gcd(u, vc) = d gcd(u, c). 
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However, d | b and gcd(b, c) = 1 imply (cf. item (d)) gcd(d, c) = 1. Hence, 
one further application of item (c) gives us 


gcd(a, c) = gced(du, c) = gcd(u, c). 
Finally, taking together both relations above, we get 
gcd(a, bc) = d- gcd(u, c) = gced(a, b) gcd(a, c). 
The rest is immediate. 


oO 


Corollary 6.23 Let a and b be nonzero integers and k, m and n be natural 
numbers. 


(a) If gcd(a, b) = 1, then ged(a™, b") = 1. 
(b) If gcd(a, b) = 1 and ab = k", then there exist u,v € Z such that a = u", 
b=v". 


Proof 
(a) Item (c) of the previous proposition, with b”—! in place of b and b in place of c, 
gives 
gcd(a, b”) = ged(a, b”~! - b) = ged(a, b"“'). 
It follows by induction on n that gcd(a, b”) = gced(a, b) = 1. Analogously, 


gcd(a”, b") = ged(a"”"~! - a,b”) = ged(a—!, b”) 


and, by induction on m, we get gcd(a”, b”) = gcd(a, b”) = 1. 

Alternatively, since gcd(a, b) = 1, Bézout’s theorem guarantees the exis- 
tence of x, y € Z such that ax + by = 1. Hence, the formula for binomial 
expansion gives 


n-1 


1 = (ax + by)" = 0 (;) (axy"“*(by)* + (by)" = aq + bry", 
k=0 


with g = a igri (by)*. Hence, item (a) of Corollary 6.16 shows 
that gcd(a, b”) = 1. Starting from this last relation and arguing in an entirely 
analogous way, we conclude that gcd(a”, b”) = 1. 


(b) Set u = gcd(a, k) and v = gcd(b, k). Since gcd(a, b) = 1, from item (d) of the 
previous proposition we obtain 


k = gcd(k", k) = ged(ab, k) = gced(a, k) - gcd(b, k) = uv, 
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so that 
ab=k® =u"v". 


Now, u | a and gcd(a, b) = 1 give us (cf. item (d) of the former Proposition) 
gcd(u, b) = 1; analogously, gcd(v, a) = 1. It thus follows from item (a) that 


gcd(u",b) = 1 and gced(v", a) = 1. 
Finally, 


u" |ab and ged(u",b)=1>u"\|a>u" <a 
v" |ab and ged(v",a) =1 Su" |bau" <b 


However, since ab = u"v", u"” <a and v" < b, the only possibility is to have 
a=u" andb=v". 
| 


We are now in position to prove a result which was already used in [8]. We 
present it as the coming 


Example 6.24 Given natural numbers n and k, with k > 1, either there exists m € N 
such that n = m* or 4/n is an irrational number. 


Proof Assume that V/n = oo with m and p relatively prime naturals (cf. 
Example 6.19). Then, p‘n = m*, and condition ged(m, p) = 1, together with 
the results of the previous corollary, gives gcd(m*, p*) = 1. On the other hand, 
p*n = m* assures that p* | m*. Therefore, the only way of p* and m* being 
relatively prime is that p* = 1. However, since k > 1, we must then have p = 1, so 


thatn = m*. oO 


Roughly speaking (for now, any attempt of giving a more precise definition 
would be useless), an equation in (more than one) integer variables is said to be 
a diophantine equation.* The analysis of general diophantine equation equations 
is generally a very difficult task, demanding quite sophisticated arguments to be 
accomplished. We shall meet (and solve) several important examples of Diophantine 
equations along the way. For the time being, let us take a look at a simple example. 


Example 6.25 Prove that there does not exist x, y € N such that x7+3 = 4y(y+1). 


Proof On the contrary, assume that x? + 3 = 4y(y + 1) for some x, y € N. Then, 


744=4y(yt+)D4+1=Q2y4+1) 


4 After Diophantus of Alexandria, Greek mathematician of the third century BC, who is considered 
to be one of the founding fathers of Algebra and Number Theory. In his famous book Aritmetica, 
Diophantus tried to systematically study integer solutions of certain particular kinds of polynomial 
equations. These, in turn, passed to be deservedly known as diophantine. 
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and, hence, 
3 2 2. 
xv = Qy+ 1) —2° = 2y— 1)(@y + 3). 


Now, if d = gcd(2y — 1, 2y +3), then d is odd and divides (2y +3) —(2y —1) = 4, 
so that d = 1. Therefore, the product of the relatively prime integers 2y — | and 
2y + 3 is a perfect cube, and Corollary 6.23 assures that 2y — 1 and 2y + 3 must 
themselves be perfect cubes. However, one cannot find two perfect cubes whose 
difference is equal to 4, so that we have reached a contradiction. Oo 


In spite of the ad hoc character of the solution of the previous example, the theory 
we have developed so far, on the gcd of two integers, allows us to completely solve 
the diophantine linear equation in two variables ax + by = c. Here, x and y are 
integer unknowns and a, b and c are given nonzero integer parameters. 


Proposition 6.26 Leta, b and c be nonzero given integers. The equation ax + by = 
c admits integer solutions if and only if gcd(a, b) | c. Moreover, if this is so and 
d = gcd(a, b) and x = xo, y = yo is some integer solution, then the formulas 


ie a (6.6) 
x=xo+-—t, y=yo——t, : 
ora Y= Yo F 

t € Z, give all possible integer solutions. In particular, ifab > 0 and x, y € Zis an 
integer solution, then we can assume thatx >0 > yorx <O<y. 


Proof Firstly, suppose that there exist x, y € Z for which ax + by = c. Since d | a 
and d | b, it follows that d | (ax + by), i.e., d | c. Conversely, let c = de, with 
e € Z, and (by Bézout’s theorem) u and v be integers such that d = au + bv. Then, 
c=a-eu+b- ev, so that the given equation admits the integer solution x9 = eu, 
yo = ev. 

For the second part, assume again that d | c, and let x = x0, y = yo be some 
integer solution of the given equation. If x = x1, y = yj is another integer solution, 
we must have a(x; — xo) = b(yo — y1); cancelling d out of both sides of such an 
equality, we obtain 


a i 
qo x0) = =(0 — y1)- 


Thus, 5 | § (x 1 —Xo) and, since gcd (5, 5) = |, item (a) of the previous proposition 


guarantees that E | (x1 — xo). Letting x; — x9 = Fr, we get yw — YW) = at, and 
the stated formulas easily follow. Conversely, it is immediate to verify that such 
formulas do provide integer solutions to the given equation. 

For what is left, assume that a, b > 0 (the remaining case a, b < 0 can be dealt 
with in an analogous way), so that 


b d d 
ide ph es and we—Pebers =": 
d b d a 
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Therefore, by choosing t € Z such that t > — #0, a0 and letting x} = x9 + bt 


and yy = yo — St, we have x1 > 0, yi < 0 and ax; + by; = c. Analogously, 
one can show that we can take an integer solution x, y of the given equation with 
x<O<y. Oo 


Up to this moment, we only know that one can write the gcd of two nonzero 
integers a and D as a linear combination au + bv, for some u, v € Z. However, we 
still do not have at our disposal a method of effectively doing this. To remedy such a 
situation, we consider the following algorithm, usually attributed to Euclid himself. 


Euclid’s Algorithm 
Step 1 la = bay+rn O<r<b 
Step 2 b= ngatra 0<m<r 
Step 3 ry = nqgt+ry 0<r3<r2 
Step J DVj-2 = Vj-19j + rj 0< rj <Tj-1 


Step j+1:rj-1 rjqjti +0 


Note that the execution of the above algorithm actually stops after a finite number 
of steps, for, since 71, 72, ... are integers satisfying b > r) > r2 > --- > O, there 
must exist a least index j such that r; is the last nonzero remainder in the sequence 
of divisions above. 

The importance of Euclid’s algorithm relies upon the following result. 


Proposition 6.27 In the notations above, we have gcd(a, b) = rj. 


Proof Several applications of item (b) of Proposition 6.22 give 


ged(a, b) = gced(a a bq, b) = ged(r), b) 
= gced(r}, b — r1qz2) = ged("1, r2) 
= gced(r} — r2q3, r2) = gcd(73, r2) 


= ged(rj-1,7j) = rj, 
where, in the last equality above, we used the fact that r; | rj—1. Oo 


Euclid’s algorithm provides a method for effectively finding a pair of integers u 
and v whose existence is assured by Bézout’s theorem, i.e., such that gcd(a, b) = 
au + bv. Let us see how to do this by looking at a numerical 


Example 6.28 Use Euclid’s algorithm to show that gcd(120, 84) = 12. Then, solve 
the linear diophantine equation 120x + 84y = 12. 
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Solution We start by executing Euclid’s algorithm: 


120 = 84-1+4 36 
84 = 36-24 12 
36 = 12-3. 
Since 12 is the last nonzero remainder, it follows from Proposition 6.27 that 
gcd(120, 84) = 12. 
We can now find integers u and v such that 12 = 120u + 84v by working 
backwards with the successive divisions that led us to the ged: 
12 = 84-.1—36-2 
= 84-1—(120— 84-1)-2 
= 84-1-—120-24 84-2 
= 84-34 120(—2). 
Finally, with the solution x9 = —2, yo = 3 of 120x + 84y = 12 in our hands, we 
can get all solutions by applying formulas (6.6): 
xX = —2+4 (84/12)t = —24 Tt; 
y =3 — (120/12)t = 3 — 10f. 
oO 


Euclid’s algorithm and the proof of its correctedness (i.e., Proposition 6.27) are 
also useful for theoretical purposes, as shown by the coming example. 


Example 6.29 Let a,m and n be given natural numbers, witha > 1. Ifq,r € Zare 
such that m = ng +r, withO <r <n, prove that 


gcd(a” — 1,a" —1) = gecdim.n) _ 4 


Proof Let us start by showing that, if r = 0, then (a” — 1) | (a’” — 1). To this end, 
it suffices to observe that a” — 1 = (a”)% — 1, and to recall that we already know, 
from Example 6.3, that a” — 1 divides (a”)? — 1. 

We now prove that, if r > 0, then gcd(a” — 1, a” — 1) = ged(a” — 1, a’ — 1). 
Indeed, letting a” = b whenever convenient, we get 


a” —1 =a™*" —1 =(a4% —- la’ + (a -1) 
= ((a")4 — Da" + @ —1) 
= (a" —1)(b- 1 +---+b4 1a" + (a — 1). 
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Letting 
c= (bt! +b? +---4+b4+ 1a" 
d =gcd(a" — 1, a” — 1) 
d’ = gcd(a" — 1,a" — 1) 

we get 


a” —~1l=(a"—-1)c+(a’ —-1). 
Hence, item (b) of Proposition 6.22 gives 


gcd(a” — 1,a" — 1) = ged((a” — 1)e + (a’ — 1), a" — 1) 
= gced(a’ — 1, a” — 1). 
For what is left to do, assume, without loss of generality, that m > n. Ifm =n, 


we are done. Assume, then, that m > n, and consider Euclid’s algorithm for m 
and n: 


m =nqgtni O<ri <7; 
n =riqotr2 0<1r <r}; 
ro =nggt+r3 0<7r3 <1; 


rj-2 =Vj-14j Tj 0< rj <Tj-13 
rj-1 =Vjqjtit 0. 


Our previous discussion guarantee that 
gced(m, n) = ged(n, r}) = ged(r, 172) = +++ = ged(rj-1, rj) =1;- 
Therefore, by applying the above reasoning several times, we get 


gcd(a” — 1,a" — 1) = gced(a” — 1, a"! — 1) 
= gced(a"! — 1, a’? — 1) 


= ged(a"i-! — 1, a") — 1) 
=qdi-l= qed nT 
oO 


We finish this section by studying the least common multiple of a finite set 
of nonzero integers. Given nonzero integers a), a2, ..., Gn, the positive integer 
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|aja2...d,| is acommon multiple of a1, a2, ..., dn. Therefore, there exists a least 
positive integer which is a common multiple of a), a2, ..., dy, and this gives sense 
to the following 


Definition 6.30 Given nonzero integers a1, a2, ..., dn, the least common multiple 
of aj, a2, ...,An, denoted Icm (a1, a2, ..., Gn), is the smallest positive integer which 
is acommon multiple of all of a), a2, ..., dy. 


The coming results establish the basic properties of the lcm. 


Proposition 6.31 Let aj, a2, ..., Gn be nonzero integers with |cm equal to m. An 
integer M is a common multiple of aj, a2, ..., An if and only ifm | M. 


Proof Let M be acommon multiple of a1, a2, ..., d, and write M = mq +r, with 
q,r € Zand0 <r <™m. Since a | m and aq, | M, it follows from Proposition 6.5 
that a; | (M — mq), i.e., a, | r; analogously, a2,...,d, |r, so thatO <r <misa 
common multiple of a1, a2, ..., dn. However, since m is the least (positive) common 
multiple of a1, a2, ..., ay, the only left possibility is thatr = 0. Hence,m|M. O 


Lemma 6.32 Let aj, a2, ..., Gn be nonzero integers and k € N. Then, 
Icm (ka,, kaz,..., kan) =k - lem (aq, a2, ..., dy). 


Proof If M = \em (kay, kao, ..., kay) andm = |Icm (aj, a2,..., dy), thenkm isa 
positive common multiple of kaj, kaz,..., kay, so that km > M. On the other hand, 
since M is a (positive) common multiple of ka,, kaz,..., kay, it follows that k | M 
and M/k is a positive common multiple of a1, a2, ..., dn; therefore, M/k > m or, 
which is the same, M > km. We conclude that M = km, as desired. oO 


Our next result relates the gcd and the lcm of two nonzero integers. 


Proposition 6.33 If a and b are nonzero integers, then 
gced(a, b) - lem(a, b) = |ab|. 


Proof We shall begin by showing that gcd(a, b) = 1 => Icm(a, b) = |ab|. To this 
end, let m be a positive common multiple of a and b. Since gcd(a, b) = 1, item 
(e) of Proposition 6.22 guarantees that ab | m. Hence, m > |ab|, so that |ab| is the 
smallest positive common multiple of a and b, i.e., |ab| = lcm (a, b). 

For the general case, first note that lcm (a, —b) = |cm (a, b); since we already 
have gcd(a, —b) = gcd(a, b), we can assume, without loss of generality, that a, b > 
0. Let d = gcd(a, b) and write a = du and b = dv, so that u and v are positive and 
relatively prime integers. We wish to show that lcm (du, dv)d = d7uv or, by the 
previous lemma, that lcm (u, v) = uv. But this is exactly what we have done in the 
paragraph above. oO 


Example 6.34 (Japan) Find all pairs (a, b) of positive integers such that a > b and 


Icm (a, b) + gced(a, b) +a+b=ab. 
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Solution Set d = gcd(a, b), and let u, v € N be such that a = du and b = dv. 
Then, a > b> u > v and 


d- |Icm(a, b) = gcd(a, b)- Iem(a, b) = ab = d?uv, 


so that lem (a, b) = duv. Substituting these expressions in the original equation, 
we conclude that it is equivalent to 


u(v+1)+(v4+1) =dur, (6.7) 
so that u | (v + 1). Thus, uw < v + | and, therefore, vu = voru=v+1.Ifu=v, 
it follows from gcd(u, v) = | thatu = v = 1, sothatd = 4 anda =b= 4. If 


u = v-+ 1, it follows from (6.7) that 


_¥@tD+@t) _ (v+1)?+(v4+1) 


d 
uv (v+ 1)v 
v2 +3v+2 2 
— 5 =1+ 5 
vr +v 1) 


which in turn implies v = 1 or v = 2. By separately examining each of these cases, 
we arrive at the remaining solutions: a = 6, b = 3 ora = 6,b = 4. oO 


Problems: Sect. 6.2 


1. (IMO) Forn € N, prove that ged(21n + 4, 14n + 3) = 1. 

2. (OIM) Find all positive integers m and n such that 2” + 1 = n?. 

3. Consider two infinite and nonconstant arithmetic progressions whose terms are 
positive integers. Prove that there exist infinitely many naturals which are terms 
of both of these sequences if and only if the gcd of their common differences 
divides the difference of their first terms. 

. * Givena,m,n €N, with m # n, prove that gcd(a?" +1, go 1) =1or2. 

. (Japan) Prove that gcd(m! + 1, (1 + 1)!+ 1) = 1, for every n €N. 

. (TT) If a and b are natural numbers for which ab | (a? + b?), show that a = b. 

. (Brazil) We partition an a x b rectangle (a,b € N) into ab squares 1 x 1. 
Prove that each diagonal of the rectangle crosses the interior of exactly a + b— 
gcd(a, b) squares | x 1. 

8. Let n and k be positive integers, with n > k. Prove that the gcd of the binomial 


numbers 
n n+1 n+k 
ky? k os k 


NAN 


equals 1. 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
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(Miklés-Schweitzer) Let n > 1 be a given integer and 2 be the greatest power 
of 2 that divides it. Prove that the gcd of the binomial numbers 


2n 2n 2n 2n 
ie 20) geese 
is equal to 2*+!, 


(USA—adapted) For a fixed k € N, show that 
max{ged(n? +k, (n+ 1)? +k); ne N} =4k4+1. 


(Hungary) If a, b and c are positive integers such that , + ; — i, prove that 
there exist positive integers g, u and v for which gcd(u, v) = | anda = qu(u+ 
v),b=qvu(u+v) andc = quv. 

(England) Let x and y be natural numbers such that 2xy divides x? + y? — x. 
Prove that x is a perfect square. 

(USA) Of any set of ten consecutive natural numbers, show that one can always 
choose at least one element which is relatively prime to all of the nine remaining 
ones. 

Let a, b and m be given naturals, with gcd(a, m) = 1. Show that 


m—| F 
> | + = (a 1)(m —1) +b. 


; m 
j=0 


For the coming problem, recall (cf. [8], for instance) that the fractionary 
part of x € R is the real number {x} such that {x} = x— |x]. Thus, {x} € [0, 1) 
and {x} =O Sx eZ. 

(Japan) Let n, r € N be such thatn > | andn {r. If d = gcd(n,r), prove that 


n} 2 : 


i=l 


(Putnam) Given m,n € N, with m > n, prove that 


gcd(m, n) (") cN. 


m n 


(Bulgaria) Let (ay)n>1 be the sequence of positive integers defined by aj = 2 
and an41 = a? — a, +1 forn € N. Show that any two terms of this sequence 
are relatively prime. 

* As usual, let (Fi,)n>1 be the Fibonacci sequence, i.e., the sequence given by 
F, = Fy = 1 and Fyyi2 = Fey, + Fy for every k € N. Do the following 
items: 


6.2 


19. 


20. 


21. 


22: 
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(a) Prove that gcd( Fy, Fn+1) = 1 for everyn € N. 
(b) Prove that, for alln, k € N, withn > 1, one has Fy4x = Fn—1 Fe + Fh Fry. 
(c) Use item (b) to successively conclude that, for alln,g,r € N: 


G@) ged(Fng, Fn) = Fa. 
(ii) ged(Prg-1, Pn) = 1. 
(iii) gcd(Frg+r, Fn) = gcd(F,, Fn). 


(d) Show that gcd(Fy, Fin) = Fecam,n) for all m,n € N. 


(Croatia) Let (ay )n>1 be a sequence of integers such that aj = 1 and ay,42 = 
a24n+1 + Gn for every n € N. Prove that gcd(am, Gn) = Agcaim,n)- 

Given relatively prime natural numbers a and b, establish the following 
results: 


(a) Every natural number n > ab can be written asn = ax+by, withx, y EN. 

(b) The natural number ab cannot be written as in (a). 

(c) Every integer n > ab — a — bcan be written in the form n = ax + by, for 
some nonnegative integers x and y. 

(d) The integer ab — a — b cannot be written as in (c). 

(e) There are exactly x(a — 1)( — 1) nonnegative integers which cannot be 
written in the form ax + by, for some nonnegative integers x and y. 


(IMO—adapted) Let a, b and c be pairwise relatively prime natural numbers. 


(a) Show that there do not exist x, y, z € Z4 such that 2abc — ab — bc —ca = 
xbe + yac + zab. 

(b) Ifn € N satisfies n > abc — a — bc, use the result of the previous problem 
to establish the existence of x,t € Z, such thatn = xbc + ta, withO < 
x<a-l. 

(c) Show that the integer t of item (b) is greater than bc — b — c, and use 
the result of the previous problem once again to establish the existence of 
y,z € Z, for which t = bz + cy. 

(d) Ifn € N satisfies n > 2abc — ab — bc — ca, use the results of items (b) and 
(c) to show that there exist x, y, z € Z, such that n = xbc + yac + zab. 


(IMO—shortlist) Generalize the result of the previous problem in the following 
way: if a), a2, ..., a, are pairwise relatively prime natural numbers and 


(here, aj indicates that a; is not present in the product x ja, .. a; ... Gy), then 
n 1 n 

n-1- — aj 

i= i= 


is the greatest natural number which does not belong to A. 
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23. It is interesting to ask how efficient Euclid’s algorithm is, upon computing the 
gcd of two nonzero integers. The purpose of this problem is to shed light on this 
question, by proving a result of Lamé.° To this end, do the following items: 


(a) If Fx is the k-th Fibonacci number and n € N, prove that Fy45 > 10F). 
Then, deduce that F5,42 > 10”, so that the decimal representation of F5;+2 
has at least n + 1 digits. 

(b) Given natural numbers a > b, assume that Euclid’s algorithm takes n steps 
to compute gcd(a, b). Prove that b > Fy_-1. 

(c) Prove Lamé’s theorem: given natural numbers a > b, the number of steps 
needed in Euclid’s algorithm to compute gcd(a, b) is at most five times the 
number of decimal digits of b. 


24. (Brazil) Letn > 1| be an integer. Prove that 


hoes ie 
2 3 n 


is never an integer. 
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An integer p > 1 is prime if its only positive divisors are 1 and p. An integer 
a > | which is not prime is said to be composite. We shall prove in a few moments 
(cf. Theorem 6.38) that the set of primes is infinite. For the time being, and for the 
reader’s convenience, we list below all prime numbers less than 50; their primalities 
can be checked directly and without difficulty (see, also, the discussion below on the 
Eratosthenes’ sieve): 


2,3,5,7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47. 


Our purpose in this section is to study the elementary properties of prime 
numbers, paying special attention to their relation to composite numbers. We start 
with the following auxiliary result, which is known as Euclid’s lemma. 


Lemma 6.35 (Euclid) Every integer n > 1 can be written as a product of finitely 
many primes, not necessarily distinct.® 


Proof Let us make induction on n. For n = 2 there is nothing to do, for 2 is prime. 
Now, let m > 2 be a given integer and assume that every integer n such that 2 < 


5 After Gabriel Lamé, French mathematician of the nineteenth century. 
Here, for the sake of simplifying the writing, we say that a prime is a product of one prime. This 
is in accordance with the general definition of ITj- 1 @;, for which ae aj =a. 
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n < mcan be written as a product of finitely many primes; we shall prove that the 
same is true for m itself, and to this end we distinguish two distinct cases: 


(i) if m is prime, there is nothing to do. 

(ii) If m is not prime, there exist integers a and b such that m = ab, with 1 < 
a, b < m. By induction hypothesis, each of a and b can be written as a product 
of finitely many primes, say a = p,... py andb = q, ---q, withk,/] > 1 and 
Pi,--+, Pk» Q1,--->q1 primes. Therefore, 


m=ab= p,...Peqi-:- WU, 


so that m is also a product of finitely many primes. 
oO 


A useful consequence of the previous lemma is the following criterion for 
searching prime divisors of a composite numbers, due to the Greek mathematician 
of the third century BC Eratosthenes of Cyrene.’ 


Corollary 6.36 (Eratosthenes) Jf an integer n > 1 is composite, then n has a 
prime divisor p such that p < ./n. 


Proof Since n is composite, we can write n = ab, with a and b integers such that 
1 <a < b. Letting p be a prime divisor of a (which exists, by Euclid’s lemma), it 
follows that p | n and 


Therefore, p < /n. Oo 


The previous corollary can be successfully used to establish the primality of a 
small natural number, as shown by the coming example. The method involved is 
known in mathematical literature as Eratosthenes’ sieve, and consists in dividing 
the natural number at scrutiny by each of the primes which are less that its square 
root. According to Eratosthenes’ theorem, if none of such divisions is exact, then 
the number is prime. 


Example 6.37 Use the previous result to prove that 641 is prime. 


Proof Firstly, note that 25 < 641 < 26. Hence, if 641 is composite, then 
Eratosthenes theorem guarantees that 641 must possess a prime divisor p < 25, 
so that 


p € {2,3,5, 7, 11, 13, 17, 19, 23}. 


7 Apart from his work in Number Theory, another important achievement of Eratosthenes was an 
indirect measurement of the Earth’s diameter, with an impressive degree of accuracy for his time. 


182 6 Divisibility 


Nevertheless, it is immediate to verify that, upon dividing 641 by each of the primes 
above, no division is exact (i.e., leaves zero remainder). Therefore, 641 is indeed a 
prime number. Oo 


A slight modification of the procedure above can be used to find prime numbers 
recursively. For example, imagine one wants to find all prime numbers less than or 
equal to 150. One starts by erasing, from the list of odd numbers from 3 to 150, those 
which are multiples of 3. After that, the least number greater than 3 and not erased 
is 5, which is then prime; thus, we perform a second step, by erasing all multiples of 
5. This step leaves 7 as the least number not erased, so that 7 is prime; in the third 
step, we then erase all multiples of 7. Continuing to proceed this way, we get the 
following list as result: 


3 5 7 ¥ 11 13 KB 17 19 2W 23 2B 2 29 
31 3% 3S 37 BF 41 43 45 47 AS ST 53 5S 57 
59 61 63 6 67 6Y 71 73 BK HW 719 S 83 & 
BT 89 9 9% OF 97 OF 101 103.105 107 109 141 113 
MS UT U9 12T 125 125 127 129 131 133 135 137 139 147 
143 145 147 149 151 153 155 157 159 16T 163 165 167 169 


The numbers which were not erased are prime. 

In spite of the above, the importance of Eratosthenes’ sieve lies on a theoretical, 
rather than practical, level. For instance, in order to decide whether or not 999997 is 
prime, we would have to divide it by all prime numbers less than “999997 = 1000, 
which is not feasible without the aid of computer programming.® 

Back to Euclid’s Lemma 6.35, if we gather together equal primes we conclude 
that every integer n > 1 can be write in the formn = py’... py‘, with pi, ..., Pk 
pairwise distinct primes and a, ...,a@% € N. We shall see in Theorem 6.43 that, up 
to a permutation of factors, such a way of writing n is unique. However, before we 
do that, let us prove another result of Euclid, which shows that the set of primes is 
infinite. 


Theorem 6.38 (Euclid) The set of prime numbers is infinite. 
Proof By induction on n, let us prove that if N contains n primes, then it contains 


n+ 1 primes. Suppose, then, that p1,..., Py are pairwise distinct primes, and let 


m=p.--Prtl. 


8 Actually, the situation here is even more dramatic. With a little more work, it is possible to 
show that Eratosthenes’ sieve is not a polynomial algorithm. This means that the quantity of 
arithmetic operations one needs to perform in order to compute the n-th prime with the aid of it 
grows exponentially with n. Thus, even with the aid of a powerful computer, Eratosthenes’ sieve 
is not a feasible primality algorithm for large numbers. 
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By Euclid’s Lemma 6.35, there exists a prime p such that p | m. If p = p; for 
some | <i <n, then p| p1... py. It follows from Proposition 6.5 that p divides 
the difference m — p,... Pn, i.e., p | 1, which is an absurd. Therefore, p is a prime 
number distinct from all of the p;’s, so that we have at leastn + 1 primesinN. O 


Remark 6.39 In spite of the above result, there are arbitrarily large gaps in the 
sequence of primes. Indeed, given an integer k > 3, for all of the consecutive and 
positive integers a + 2,a+ 3,...,a +k to be composite it suffices that a be a 
common multiple of 2, 3, ..., k. We shall have more to say on the distribution of 
primes along N in Chap. 9. 


As the next example shows, small variations of the argument used in the proof 
of Euclid’s theorem allow us to prove that several arithmetic progressions contain 
infinitely many primes among their terms.” 


Example 6.40 Prove that there are infinitely many primes of the form 4k — 1. 


Proof Assume that there is only a finite number of primes of the form 4k — 1, say 
Pi = 3, p2 =7, p3 = 11,..., p;:, and consider the number 


m=4p,p2...p;r—1. 


Then, m > | and, letting m’ = pi po... py, we have m = 4m’ — 1. 

On the other hand, Lemma 6.35 guarantees the existence of odd primes q1, ..., gi 
such that m = q1 ...qi. Now, observe that every odd prime is of one of the forms 
4q' — 1 or 4q' + 1, for some q' € Z. If qi = 4q; + 1 for 1 <i < t, we would get 


m=q...q = (4q,+1)...(44¢) +) =4¢ +1, 


for some g € N. But this would contradict the fact that m = 4m’ — 1. Hence, there 
exists at least one index 1 <i </ such that g; = 4q} — 1. 


Finally, since pi, p2,..-, p; are all primes of the form 4k — 1, we must have 
qi = pj for some | < j < t. However, since g; | m, we conclude that p; should be 
a divisor of m = 4p; p2... p: — 1, which is a contradiction. oO 


For the proof of Theorem 6.43 we shall need the following auxiliary result, which 
is important in itself. 


Lemma 6.41 If a,,...,a, € N and p is a prime such that p | ayaz...@n, then 
there exists | < i <n such that p | aj. In particular, if aj, ..., dy are also primes, 
then there exists 1 <i <n such that p = qj. 


° Actually, it is possible to prove a much more general result, due to the Dirichlet, which states that 
every infinite and nonconstant arithmetic progression of naturals contains infinitely many primes, 
provided its first term and its common difference are relatively prime. We shall see a relevant 
particular case of Dirichlet’s theorem at Sect. 20.2. 
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Proof We shall prove the case n = 2, the general case being totally analogous. 
Assume that p | (ab) but p { a, and let d = gcd(a, p). Since d | p, we have d = 1 
or d = p; however, since p { a andd | a, we conclude that d # p, and hence d = 1. 
Thus, since p | (ab) and gcd(a, p) = 1, it follows from item (a) of Proposition 6.22 


that p | b. 
The particular case follows at once from the first part, together with the very 
definition of prime number. Oo 


Example 6.42 If p is a prime number, then p divides each of the binomial numbers 
(2), for every integer 1 <k < p—1. 


Proof We can restrict to the case of an odd p. Since 


Ge Pp! (k+1)...(p— Dp 
kk) k\p—k! (p —k)! 


is an integer, we have that (p — k)! divides (k + 1)...(p — 1)p. Now, since p is 
prime and p { 1,2,..., p — k, the previous lemma guarantees that p { (p — k)!, 
and hence gcd(p, (p — k)!) = 1. Therefore, item (a) of Proposition 6.22 shows that 
(p — k)! must divide (k + 1)... (p — 1). In turn, this fact allows us to write 


(le &+1)...~-)) 
kk} (p —k)! 
eN 


’ 


so that (?’) is a multiple of p. Oo 


The coming result is the cornerstone of elementary Number Theory, and is 
known as the Fundamental Theorem of Arithmetic. Whenever convenient, we 
shall abbreviate it as FTA. 


Theorem 6.43 Every integer n > 1 can be written as a product of powers of 
distinct primes. Moreover, such a decomposition of n is unique in the following 
sense: ifn = By on = qi wet 4 with py <--- < peandq, <---<q 
being primes and a;, Bj = 1 being integers, thenk = | and pj = qi, a = Bi for 
1<i<k. 


Proof The existence was established in the paragraph preceding Theorem 6.38. 

For the uniqueness, let the integer n > 1 have two decompositions as in the 
statement. Since p; | n, we have that pj | qe : ate "and the previous lemma 
guarantees the existence of an index 1 < j < // such that pj = qj. On the other 
hand, since q; | n, we have that q | Py! or | ae again by the previous lemma, there 
exists an index 1 <i < k such that gq; = p;. Thus, pj = gj; > qi = pi = pi, from 
where we get p1 = q and, hence, 


Rap By eee = Pe Gy sa 
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We shall now prove that a; = 6;. By contradiction, assume that a; 4 6,. We have 
two different cases to consider: 


G@) a =< pe then py ...pe = Oe Aa cet so that pj | ps’... py". 


Arguing as above, there would exist an index 2 < i < k such that py = pj, 
which is an absurd. 

(ii) a > Bi: then ne. ont, = ge? - ge Again as above, we would get 
Pi = qj; for some index 2 < j < /, and this would give us the contradiction 
Pli=4j > 41 = Pl. 


Thus, a; = f1, and it follows that p5?... py" = ae se qe. 


Repeating the above reasoning several times (there is an induction here, which 
shall not be formalized for the sake of clearness), we successively conclude that 
P2 = q2 and a2 = fo, p3 = q3 and a3 = #3, etc. At the end, if k < 1 we would get 


l= af a ar ah ' which is clearly impossible. Likewise, k > | is also impossible, 
so that k = / and there is nothing left to do. Oo 


The FTA will be almost omnipresent along these notes. However, it is instructive 
to the reader that we pause for a moment to illustrate how one can profitably use 
such a result in concrete situations. 


Example 6.44 The natural numbers x and y are such that 3x2 +x = 4y? + y. Prove 
that x — y is a perfect square. 


Proof Let p be a prime number and p“, p? and p° be the greatest powers of p 
dividing x, y and x — y, respectively (here we are already invoking the FTA). Let us 
momentarily assume that a < b. Then, p-4 | x2, iy, and it follows from item (c) of 
Proposition 6.5 that p*“ | (4y? — 3x7). Since x — y = 4y* — 3x”, we then have that 
p~* | (x — y), and the uniqueness part of the FTA gives c > 2a. On the other hand, 
writing 


x* = (x-y) - 40? — x2) = —- yy) +407 +9) 


and recalling the way we chose c, we conclude that p° | x7. Therefore, once more 
from the uniqueness part of the FTA, we obtain c < 2a. Hence, c = 2a, which is an 
even number. If a > b we conclude, quite similarly, that c = 2b. Finally, since the 
prime p was arbitrarily chosen, the FTA presents x — y as the product of powers of 
primes with even exponents. Therefore, x — y is a perfect square. oO 


The presentation of an integer n > 1 as the product of powers of distinct primes 
is said to be its canonical factorisation or decomposition in prime powers. The 
following corollaries collect some useful consequences of the existence of such a 
factorisation. 


Corollary 6.45 Let a, b, m and n be natural numbers for which a" = b". If 
gcd(m, n) = 1, then there exists c € N such thata = c" andb =c". 
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Proof It is clear that a and b are divisible by exactly the same primes, say p; < 


. < pe. Leta = p™... pt and b = pf... pe, with aj, Bj > 1, be the 


canonical decompositions of a and b. Then, 


ma, mak om _ pn _ np, nBx 
Pi Py Ha =D = py... Py 


and the uniqueness part of the FTA gives 
ma; =npj, V 1<i<k. (6.8) 


Thus, m | nf; and, since m and n are relatively prime, item (a) of Proposition 6.22 
allows us to conclude that m | 6;. Letting u; € N be such that 6; = mu;, (6.8) gives 


a; = nu;. Therefore, letting c = p}' ... py, we obtain 
=P nap, Se Ghd be 7 a, ee 


oO 


Let us now learn how to obtain the canonical decompositions of the positive 
divisors of an integer greater than | and the gcd and Icm of two nonzero integers. 


Corollary 6.46 [fn = pe ee pe is the canonical decomposition of the integer 
n > 1, then the positive divisors of n are the natural numbers of the form 
te sah with 0 < a; < B; for 1 <i < k. In particular, letting d(n) denote 
the number of positive divisors of n, then 


k 
din) =| [6 +0. (6.9) 
i=1 


Proof \fd > 1 is a divisor of n and p is a prime that divides d, then p also divides 
n, so that p is equal to one of the primes p;,..., px. However, since this is valid 
for every prime divisor of d, it follows that d = o. Sirs pe , with a; > O for every 
i (we not necessarily have a; > 1, for it may well happen that p; { d for one or 
more values of i). Now, if d’ € N is such that n = dd’, then, as above, we get 


tg 


d= Pi! = P,. with a! > 0 for every i. Therefore, 


a+) aay, 


P| ++ DE =dd' =n= pi... ph, 


and the uniqueness part of the FTA gives a; + o; = B; for every i. In particular, 
a; < P; for every i. 

Conversely, it is immediate to show that every natural number d of the form 
ae ar Des with 0 < a; < B; for 1 <i <k, is a positive divisor of n. 

For what is left to do, what we did above, again in conjunction with the 
uniqueness part of the FTA, assures that the number of positive divisors of n 
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coincides with the number of sequences (a1, ..., @%) of integers such that 0 < a; < 
6; for 1 <i <k. Since there are 8; + | possibilities for a;, formula (6.9) follows at 
once from the fundamental principle of counting (cf. Corollary 1.9). oO 


The following example presents a simple application of the former result. 


Example 6.47 Prove that the natural number n is a perfect square if and only if d(n) 
is odd. 


Proof Firstly, since 1 = 1? and d(1) = 1 is odd, we can assume that n > 1. Then, 
let > 1 be an integer and n = pe re oS be its canonical factorisation into prime 
powers. 

If n is a perfect square, say n = m7, with m € N, then m | n, so that m = 


py .-- py, with 0 < a; < fj for 1 <i < k. Therefore, 


: 2 2 
py eB =n=m Ep ede 
and the uniqueness part of the FTA gives 6; = 2a; for 1 < i < k. Thus, the previous 
corollary gives 


d(n) = (6, +1)... (Be +1) = Qa + 1)... ax + 1), 


which is and odd number. 
Conversely, let d(n) = (8; + 1)...(6,% + 1) be odd. Then, 6; + 1 is odd for 
1 <i <k, which in turn gives 6; even for 1 <i <k. Writing 6; = 2a;, we get 


_ 201 2a (ay a\2 2 
N= py -++PE = (Pp; ne) =m’, 


Witt = py ane Dy o 

For the next corollary, it is useful to extend the canonical decomposition of an 
integer greater than | by allowing exponents equal to 0. For instance, we can write 
48 = 24.3 and 270 = 2-3°-5 by using the prime numbers 2, 3 and 5 in both cases, 
so that 


48 = 2*.3.5° and 270 = 2.33.5. 


Corollary 6.48 Let a,b > 1 be natural numbers such that a = ae bee a and 


b= pi ... Pe’, with Pi <°+++ < pe prime numbers and aj, Bj = 0 for 1 <i <k. 


k k 
gcd(a, b) = I] fe and \cm(a, b) = I] pee 
i=l i=l 
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Proof Letus do the proof for the gcd of a and b (that for the lcm of a and bis totally 
analogous). Since min{a;, 6;} < a;, 6; for every i, the number d = The gre 
divides both a and b. Now, let d’ be a common positive divisor of a and b. 
Corollary 6.46 gives d’ = oe he pr with y; > O for every i; moreover, since, 
d' | a, b, it also gives y; < a;, B; for every i. Hence, we have y; < min {q;, 6;} for 
every i, so that (once more from Corollary 6.46) d’ | d. Therefore, d = gcd(a, b). 
Oo 


Back to the last paragraph before the statement of the above corollary, we can 
now readily compute 


gcd(48, 270) = 2-3 =6 and Icm(48, 270) = 2*- 33.5 = 2160. 


For what comes next, if a natural number n and a prime number p are such 
that p < n, then we obviously have that p | n!. The following result, due to 
A-M. Legendre,!° teaches us how to compute the exponent of p in the canonical 
decomposition of n!. Accordingly, (6.10) is known as Legendre’s formula. 


Proposition 6.49 Let n > 1 be natural and p be prime. If ep(n) denotes the 
exponent of p in the canonical factorisation of n!, then 


ep(n) = > Fak (6.10) 


ko LP 
Proof Firstly, note that the above sum is actually finite, for Loe = O whenever 
p* > n. Now, if k € Nand p*, 2p*,..., mp* are the multiples of p* which are less 


than or equal to n, then mp* < n < (m+ 1)p* or, equivalently, m < x <m+l. 


Hence, m is the greatest integer which is less than or equal to nz ie.,m= Lord: In 
turn, this shows that for each integer k > 1 there are exactly 


lL] 

pk pet 

natural numbers which are less than or equal to n and are multiples of p* but not of 
p**!. Each of these naturals contributes with exactly k factors p to e p(n), so that 


m= ¥4( 5 ]-Laee]) ZL 


The two coming examples bring interesting applications of Legendre’s formula. 


Oo 


10 After Adrien-Marie Legendre, French mathematician of the eighteenth and nineteenth centuries. 
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Example 6.50 (Soviet Union) In how many zeros does 1000! end? 


Proof We have to find the greatest power of 10 that divides 1000!. To this end, and 
writing 1000! = 2¢2(1000) . 5¢5(1000) 7 with m € N, it is enough to compute the 
smallest of e2(1000) and e5(1000). It is clear that the smallest of these is e5(1000), 
which can be promptly computed with the aid of Legendre’s formula (and taking 
into account that 5> > 1000): 


4 

1000 

e5(1000) = )> “ay | = 200+40-+8-+ 1 = 249. 
j=l 


Therefore, 1000! ends in 249 zeros. oO 
Example 6.51 (Yugoslavia) Prove that 2” { n!, foreachn € N. 


Proof The result is clearly true forn = 1. Forn > 1, Legendre’s theorem assures 


n 


that the greatest power of 2 dividing n! has exponent k = }> et Eat However, 


|x] < x for every x and Ea = 0 for every sufficiently large 7; therefore, by means 


of the formula for the sum of the terms of a geometric series (cf. Proposition 7.38 
of [8], for instance), we get 


Hence, k <n, so that 2” { n!. Oo 


Problems: Sect. 6.3 


1. Let p and g be two consecutive odd primes. Prove that there exist integers 
a,b,c > 1, not necessarily distinct, such that p + q = abc. 

2. (IMO—adapted) Let k € N be such that p = 3k + 2 is prime. If m and n are 
natural numbers such that 


m 1 1 1 I af 1 
n 2 3 4 2k 2k +1" 


prove that p | m. 
. (Hungary) Given an odd prime p, find all x, y € N such that 2 = + + i. 
4. (IMO) Find all n € N such that n(n + 1)(n+2)(n+3) has exactly three distinct 
prime divisors. 
5. (Russia) Let (a; ),> 1 be a sequence of natural numbers such that gcd(q;, aj) = 
gcd(i, 7) whenever i ~ j. Prove that a, = n for every n > 1. 


iS) 
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6. 


11. 


12. 


13. 


14. 
15. 


16. 


6 Divisibility 


* We say that a natural n is square free ifn = 1 orn = py... px, where 
Pi <°++: < px are prime numbers. In this respect, do the following items: 


(a) Prove that every integer n > 1 can be uniquely written in the formn = ab’, 
with a, b € N and a being square free. 

(b) Use item (a) to show that, if there were exactly k prime numbers, then there 
would exist exactly 2 square free integers and that the number of positive 
integers less than or equal to n would be at most 2* /n. 

(c) Use item (b) to give another proof of the fact that there are infinitely many 
primes. 


. Show that there are infinitely many primes of each of the forms 3k + 2 and 


6k +5. 


. (Yugoslavia) For each n € N, prove that the number 2?" — 1 has at least n 


distinct prime factors. 


. If p is an odd prime, prove!! that p divides ‘es ) —2. 
. * Let a and n be positive integers, with a > 1. Do the following items: 


(a) If a” + 1 is prime, show that n is a power of 2 (we shall see in Problem 7, 
page 252, that the converse is not true).!* 

(b) If > 1 and a” — 1 is prime, show that n is itself prime and a = 2 (the 
converse is not true, as shown by the example 2!! — 1 = 23 - 89). 


(IMO) Find all naturals n > 6 for which the natural numbers less than n and 
relatively prime with n form an arithmetic progression. 
(IMO.) Let k, m and n be naturals such that m+ -+ 1 is a prime number greater 
than n + 1. If cy = s(s + 1) for each s € N, prove that 


(Cm+1 _ Ck) (Cm+2 — Ck)... (Cm-+n — Ck) 


is divisible by c1c2--+ Cn. 

(Hungary) Let n be a given natural. Show that there are exactly d(n) ordered 
pairs (u, v) with u, v € N and such that Icm (uv, v) =n. 

(IMO) Find all natural numbers a and b such that a? = b®. 

(BMO) Find all n € N such that n = dj + d} + dj + dj, where 1 = d) < dp < 
d3 < dg are the smallest positive divisors of n. 

(OIM) For eachn € N, let 1 = d; < --- < dk =n be the positive divisors of 
n. Find all n satisfying the following conditions: 


(a) k> 15. 


(b) n=d13+d14+ ds. 
(c) (ds +1)? = djs +1. 


'lFor a stronger version of this result, see Problem 15, page 310. 


'2Prime numbers of the form 27" + 1 are called Fermat primes. They enter decisively into Gauss’ 
description of which regular polygons can be constructed with straightedge and compass (cf. [20], 
for instance). 


6.3 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 
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(Japan)!3 For each integer n > 1, let [(n) be the sum of the greatest odd 
divisors of the numbers 1, 2,..., and T(n) = 1+2+----+n. Prove that there 
exist infinitely many values of n for which 3/(n) = 2T(n). 

(Australia) Find all naturals n for which d(n) = qo where d(n) stands for the 
number of positive divisors of n. 

(IMO) Let m and n be arbitrary nonnegative integers. Prove that 


(2m)!(2n)! 
m!'ni(m +n)! 


is always an integer. 


For each positive integer n, let a, = ). 


(a) Show that the binomial number a, is always even. 
(b) Prove that 4 | a, if and only if 7 is not a power of 2. 


(ORM) We say that a positive divisor d of a natural number n is special if 
d < ./n and there are no perfect squares between d and 17. Prove that every 
natural 1 has at most one special divisor. 

(Romania) Let n be a natural number whose binary representation has exactly 
k digits 1. Prove that 2”~* divides n!. 

Find all natural numbers a, b and k such that a and D are relatively prime and 
(a+ kb)(b + ka) is a power of a prime. 

(Brazil) Show that there exists a set A of positive integers satisfying the 
following conditions: 


(a) A has 1000 elements. 
(b) The sum on any number of distinct elements of A (at least one such 
element) is not a perfect power of exponent greater than 1. 


(Brazil—adapted) For each integer n > 1, let p(n) be the greatest prime divisor 
of n. Do the following items: 


(a) If g is an odd prime, show that we cannot have p(q?) > p(q2 + 1) for 
every integer k > 0. 

(b) If p(q?) < p(q2 + 1) and k is minimum with such a property, show that 
pq? — 1) < pq”). 

(c) Show that there exist infinitely many naturals n satisfying p(n — 1) < 
p(n) < p(n +). 


Let py = 2, p2 = 3, p3 = 5, ... be the sequence of prime numbers. For each 
kK EN, let xy = pip2...pet+iand A = {x1, x2,...}. Fora fixed m € N, prove 
that one can choose a subset B of A satisfying the following properties: 


(a) |B| =m. 

(b) Any two elements of B are relatively prime. 

(c) The sum of the elements of any subset of B having more than one element 
is not prime. 


'3For the converse to this problem, see Problem 5, page 226. 


Chapter 7 ®) 
Diophantine Equations sei 


Our purpose in this chapter is to study some elementary diophantine equations. 
Among these we highlight Pythagoras’ and Pell’s equation, for which we charac- 
terize all solutions. We also present to the reader the important Fermat’s descent 
method, which provides a frequently useful tool for showing that certain diophantine 
equations do not possess nontrivial solutions, in a way to be made precise. 
The aforementioned method is one of the major legacies of Pierre Simon de Fermat 
to Number Theory,! and will be frequently used hereafter. 


7.1 Pythagorean Triples and Fermat’s Descent 


Let us start by studying the nonzero integer solutions of the equation x? + y? = 
22, which, thanks to Pythagoras’ theorem of Euclidean Geometry, is known as 
Pythagoras’ equation. After we find all such solutions, we shall see how to use 
that information to solve other interesting diophantine equations. The fundamental 
result is as follows. 


Proposition 7.1 The 3-tuples (x, y, z) of nonzero integers such that x? + y* = 27 
are given by: 
x = 2uvd x = (u2 —v?)d 
y=(u*—v*)d or { y =2uvd , (7.1) 
z= (u? +07)d z= (u* + v’)d, 


where d, u and v are nonzero integers, with u and v of distinct parities and relatively 
prime. 


‘Other remarkable ones are Fermat’s little theorem, which has already appeared in Problem 5, 
page 57, and will be proved by other methods in Sect. 10.2, and Fermat’s theorem on sums of two 
squares, which will be the object of Sect. 12.4. 
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Proof We can assume, without loss of generality, that x, y, z > 0. Ifd = gcd(x, y), 
then d* | (x? + y7), so that d? | z?; therefore, d | z. Hence, there exist nonzero 
integers a, b and c for which gcd(a, b) = 1 and (x, y, z) = (da, db, dc). Moreover, 
since 


etya2ee24P=e’, 


we conclude that it suffices to find all nonzero integer solutions a, b, c of the above 
equation, subjected to the additional condition gcd(a, b) = 1. 

The requirement that a* + b* = c* and gced(a,b) = 1 easily give us (prove 
this!) gcd(a, c) = gced(b, c) = 1. Now, recall (cf. Corollary 6.8) that the square of 
an integer t leaves remainder 0 or | upon division by 4, according to whether f is 
respectively even or odd. Therefore, if a and b are both odd, then c* = a? + b? will 
leave remainder 2 upon division by 4, which is a contradiction. 

Since gcd(a, b) = 1, we are left with two cases to consider: a is odd and b is 
even, or vice-versa. Let us look at the first case, the analysis of the second one being 
entirely analogous. Since a is odd, b is even and c? = a? + b*, we conclude that c 
is also odd. Now, write 


b* = (c—a)(c +a). (7.2) 
If d’ = gced(c — a,c +a), then d’ divides both 
(c+a)+(c—a)=2c and (c+a) —(c—a) = 2a 


so that d’ | gced(2a, 2c) = 2. However, since c — a and c + a are both even, we 
conclude that d’ = 2, and we can write (7.2) as 


(5) = (54) (44). 


with gcd (S4, ofa) = |. Thus, item (b) of Corollary 6.23 assures the existence of 


relatively prime naturals u and v for which c + a = 2u? and c — a = 2v’, so that 


a=u—v’*, b=2uv, c=u +’. 


Furthermore, since c = u2 + v2 is odd, we see that uw and v must have distinct 

parities. 
Finally, direct substitution on the original equation shows that all 3-tuples as 
above are indeed solutions of Pythagoras’ equation, and there is nothing left to do. 
oO 


A 3-tuple (x, y, z) of positive integers such that x7 + y* = z? is said to be 


a pythagorean triple, in allusion to the Greek mathematician and philosopher 
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Pythagoras of Samos and his famous theorem on right triangles. Indeed, and as 
we have already mentioned, such a triple (x, y, z) determines,” up to congruence, a 
single right triangle of legs of lengths x and y and hypotenuse of length z. 

The coming example uses the previous result to find all integer solutions of 
another diophantine equation. 


Example 7.2 Find all nonzero integer solutions of the equation x7 + y* = 2z7, with 
x Atty. 

Solution In any such solution, x and y must be both even or both odd, for otherwise 
x*-+y* would be odd, hence different from 2z?. Thus, taking a = = and b = *>*, 
we have a, b € Z\ {0} (since x A ty) andx =a+b, y =a —b. Substituting such 
expressions for x and y in the original equation, we conclude that 


e+y=a2? oa +h = 2’, 
Since this last equation is that of Pythagoras, our previous result guarantees the 
existence of nonzero integers d, u and v, with u and v relatively prime and of distinct 
parities, such that 
a=2uvd, b=(u?—v?)d, z= (uw? +0°)d 


or 


a=(u?—v*)d, b=2uvd, z= (u? + v°)d. 


Hence, the solutions (x, y, z) of the given equation, with x ~ +y, are of one of 
the forms below, where d, u and v satisfy the conditions prescribed above: 


x= (WW —v*4+2uv)d, y= (—w +0? +2uv)d, z= (w+ v*)d 
or 
ee (u2 sag oh 2uv)d, y= (u2 SS ae 2uv)d, 7= (u? + v*)d. 


oO 


The equations we have analysed up to this point are, in a certain sense, privileged, 
for they possess infinitely many nonzero integer solutions. Our next example will 
display a diophantine equation in x, y and z having x = y = z = O as its 
only integer solution. The proof we shall present is an application of Fermat’s 
descent method, which can sometimes be profitably used to show that a diophantine 
equation does not have nonzero integer solutions. 


>For two different proofs of this fact, see chapters 4 and 5 of [9], for instance. 
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Schematically, Fermat’s method consists of the fulfillment of the following 
stages: 


i Assume that the given equation has a solution in nonzero integers. 
ii. From such an assumption, infer that it has a solution in nonzero integers which, 
in a certain sense (which will depend on the problem at hand) is minimal. 
iii. Deduce the existence of yet another solution in nonzero integers which is 
smaller than the minimal one of item ii., thus arriving at a contradiction. 


Example 7.3 Prove that the equation 3x? + y” = 2z7 does not have nonzero integer 
solutions. 


Proof Firstly, observe that we cannot have exactly one of the integers x, y, z equal to 
0. Hence, assume that the given equation possess a solution (x, y, z) with x, y,z € 
N (for, if (x, y, z) is a solution, then so is (+x, ty, +z)). Among all such solutions 
(x, y, z), there must exist one for which z has the least possible value, say x = a, 
y = b,z=c. Let us work this solution. 

If 3 { b, it follows from 3a” + b* = 2c? that 3 { c. But then, Corollary 6.8 
guarantees that b* and c? leave remainder | upon division by 3, and hence equality 
3a2 + b? = 2c? gives us a contradiction. Therefore, 3 | b, say b = 3b, for some 
b, € N, and it follows from 


2c? = 3a* + b? = 3(a’ + 3b7) 
that 3 | c. Letting c = 3c,, for some c; € N, the last equality above furnishes 
6c? = a? + 3b, 


so that 3 | a, say a = 3a, with a; € N. Now, plugging a = 3a, in the last equality 
above, we get 


2c? = 3a? + bi, 


so that (a), b;, c,) is another solution, in natural numbers, of the original equation. 
However, the relation 0 < cy = 5 < c is a contradiction, for we departed from 
a solution in natural numbers (a, b, c) for which the (natural) value c of z was the 
least possible one. 

Since our original assumption (namely, that the given equation has some solution) 
leads to a contradiction, we are forced to conclude that the equation has no solution 
at all. oO 


Back to Pythagoras’ equation, a natural generalization of it, posed by Fermat, 
would be to study the more general diophantine equation below: 


xt y= 2", (7.3) 
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Fermat himself claimed to have succeeded in using his descent method to prove 
that (7.3) has no nonzero integer solutions when n > 2, and for this reason such 
a result has eventually come to be known as Fermat’s last theorem. Nevertheless, 
since his reasoning was never written down, some of the most brilliant minds of 
Mathematics tried to reproduce it, year after year. As astonishing as it may seen, 
apart from particular cases, all such attempts resulted unsuccessful, and this state of 
things forged the general perception that Fermat had probably made some mistake. 

It took mankind almost 350 years to settle the issue. Indeed, it was only in the 
last decade of the twentieth century that the English mathematician Andrew Wiles 
entered History as the one to establish Fermat’s conjecture. Wiles accomplishment 
came after several years of hard work and through two long and deep papers, the 
second of which in collaboration with his former student Richard Taylor. As is 
almost always the case with such profound results in modern Mathematics, Wiles 
and Taylor built on equally deep works of other prominent mathematicians, among 
the most notable of them we should mention the Japanese Yutaka Taniyama and 
Goro Shimura, and the German Gerhard Frey.* 

Surprisingly enough, the descent method can be used to successfully approach 
the case 4 | n in Fermat’s equation (7.3). We do this next. 


Example 7.4 If the natural number 7 is a multiple of 4, then there does not exist 
nonzero integers x, y and z such that x” + y” = 2”. 


Proof Letn = 4k, with k € N. From x” + y” = z”, with x, y, z € Z \ {0}, we get 
8 + OT = 7’, 
so that (x*, y*, z**) is a solution of equation a*+b* = c? in nonzero integers. Thus, 
it suffices to show that this last equation does not admit nonzero integer solutions. 
By the sake of contradiction, let us assume that there exist a, b, c € N such that 
a* + b* = c*. We can also suppose that a, b and c are chosen in such a way that 
there is no other solution (a, 6, y), with a, 8B, y € N andy < c (as the reader 
surely expects, this is precisely the assumption that will give us a contradiction via 
the descent method). 
Since (a, b*, c) solves Pythagoras’ equation, Proposition 7.1, together with the 
minimality of c, guarantees that gcd(a?, b”) = 1, as well as that there exist relatively 
prime natural numbers wu and v, of distinct parities and such that 


a? =u*—v*, b? = 2uv, c=ure+v* 
or 


2 


a? =2uv, b? =u? — v’, cau +". 


3For the reader interested in knowing a little more on this epic endeavor, we recommend first to run 
through the best seller of Singh [35], and then to turn to the marvelous book of professors Stewart 
and Tall [37]. 
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Let us look at the first case above, the other one being totally analogous. In 
that case, a is odd and a? + v? = u?. Therefore, by invoking once more the 
characterization of pythagorean triples (and the fact that gcd(u, v) = 1) we conclude 
by the existence of relatively prime natural numbers p and q, of distinct parities and 
such that 


a=p’—@q’, v=2pq, u=p’+q’. 
Then, 
b? = 2uv = 4pq(p? +4”), 


and since gcd(p, q) = 1, it follows that both p and g are also relatively prime with 
p* +q?. Hence, in order that 4pq(p* + q7) is a perfect square, the only possibility 
is that all of p, g and p* + q? are also perfect squares, say 


p=a’, g=f’, pot+q’=y’, 


with a, 8, y € N. In turn, this finally gives 


of + pt = pt _?=y?, 


with 
c=W4vsu=ptig=y>y, 
thus contradicting the minimality of c. 
Therefore, there are no nonzero solutions of x” + y” = z” when 4 | n. Oo 


The next section shall bring a nontrivial (and important) application of Fermat’s 
descent. For the time being, let us close this section with a beautiful application of 
the characterization of pythagorean triples to a problem in Euclidean Geometry. 


Example 7.5 (Crux—Adapted) Given a positive integer r, we want to compute 
the number of pairwise incongruent right triangles ABC, satisfying the following 
conditions: 


(a) The radius of the circle inscribed in ABC equals r. 
(b) The lengths of the sides of ABC are relatively prime integers. 


Show that there are exactly 2! such triangles, where / is the number of distinct prime 
factors of r. 


Proof Let b and c be the legs and a be the hypotenuse of a right triangle whose 
inscribed circle has radius r. It is an easy exercise of Euclidean Geometry (which 
we leave to the reader) to show that r = aaa 
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Since a, b and c are relatively prime and b? + c? = a’, the characterization of 
pythagorean triples gives relatively prime integers u > v > 0, of distinct parities, 
for which 


b=u?—v’, c= 2uv, a=ur+v*. 
Thus, to count the number of triangles satisfying the given conditions is the same 


as to count the number of ordered paired (u, v) satisfying the above conditions. To 
this end, note that 


and that gcd(u, v) = 1 & ged(v,u —v) = 1. 

Letting r = p}'p;*--- p;', with p) < p2 <--+ < py primes, we conclude that 

there are exactly 2! distinct possibilities for v: 

Qj Qj 
v=1orv=p,'---p,*, 

where {i,,..., i} is a subset of {1, 2, ..., 7}. For each such v, note that u = v + c 
is completely determined and that u = v + 5 > v. We now have to check that u and 
uv have distinct parities. To this end, assume, without loss of generality, that r is even 
(the analysis of the case of an odd r is quite similar). Then, v contains either all or 
none of the factors 2 of r. If v has all factors 2 of r, then c is odd, and it follows 
that u = v + © is also odd; if v is itself odd, then 5 is even and u = v + © is odd. 
In any event, u and v have, indeed, distinct parities. 

In order to conclude that the number of right triangles we wish to count 
is precisely 2! it now suffices to show that, if (uy, v1) and (uz, v2) are two 
distinct choices for (u, v) as above, then the corresponding triangles are actually 
incongruent. To this end, assuming that vj # v2 (the case u; ¥ uz being totally 
analogous), we are going to show that the two corresponding right triangles have 
distinct hypotenuses. But this is straightforward: 


2 2 
r r 
we tu =upt+us #(H+u) +f=(L+u) + v3 
1 v7 


ees 2 
& ~—4+207 = ~ +20 
2 1 2 2 
vy vy 
2 2 
v5 —v 
2{ 2 1 2 2 
or ( oD) )=208 0 
ULV, 
- 
4 = v2, 
U1 U2 


which is an absurd. oO 
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1. (Brazil—adapted) Given n, k € N, prove that the equation x” + ky” = 2"! 
has infinitely many solutions in positive integers x, y and z. 
2. Find all solutions, in positive integers, of the equation 


(xtyP?t+z2 = +2). 


3. Show that the nonzero integer solutions of the equation x* + 2y? = z* are 


given by 


x =+(u* —2v7)d, y= +2uvd, z= +(u? + 2v*)d, 


where d, u and v are nonzero integers, with gcd(u, 2v) = 1. 
4. Show that none of the following equations have nonzero integer solutions: 


(a) x4 + 4y* = 2”. 
(b) x4 + 2y4 = 2?. 
(e).a7 4-9? = 37", 
(d) x3 + 5y? = 923, 
5. Do the following items: 


(a) (Hungary) Show that there do not exist x, y € Q such that x*+xy+y? = 2. 
(b) Find all x, y € Q for which x7 + xy + y? =1. 


6. Prove the following theorem of Euler: there does not exist nonzero integers w, 
x, y and z for which w? + x? + y? = 722. 
7. (Bulgaria) Prove that there does not exist rationals x, y and z such that 


e+ yr? 47743(x+ytz4+5=0. 


8. Given* a,b,c € Q such that a + bV2+ cJ/4 0, show that there exist 
x,y,z € Q for which 


1 
a+ bA/2 + c/4 
9. (IMO) Given n € N and a circle of radius 1, show that we can choose n points 


Aj, A2,..., An on it such that A; A; is rational, for any | <i < j <n. 
10. (IMO) Let a, b, c and d be integers such thata > b >c>d> 0. If 


=x+yV24+ 274, 


ac+bd=(b+d+a-—c)(b+d-—a+o), 


prove that ab + cd is not a prime number. 


For another approach to this problem, see Problem 9, page 486. 
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7.2 Pell’s Equation 


Recall (cf. Problem 6, page 190) that an integer greater than | is said to be square 
free if it is not divisible by the square of a prime number. In this section, we examine 
the integer solutions of equations of the form 


x? —dy* =m, (7.4) 


where d > | is square free and m is a given integer. Such an equation is known as 
Pell’s equation. 

Since d is a product of distinct primes, Example 6.24 guarantees that /d is 
irrational. In particular, we conclude that the above equation has no nonzero integer 
solutions when m = 0. Indeed, if this was not the case, then we would have x, y 4 0 
and, hence, /d = = € Q. On the other hand, if d,m < 0, then Pell’s equation 
obviously has no solutions, whereas if d < 0 < m, then (7.4) has at most a finite 
number of solutions. We also notice that, even if d,m > 0, (7.4) may have no 
solution at all, as Problem 1 shows. 

From now on, we shall assume that m = 1. (The general case is partially dealt 
with at Problem 4; in this respect, see also [27]). Before we dive into a general 
analysis, let us give a simple example showing that, in this case, Pell’s equation 
may have infinitely many solutions. 


Example 7.6 Show that the equation x? — 2y* = 1 has infinitely many positive 
integer solutions. 


Proof Note that x = 3, y = 2 is a solution. On the other hand, we can generate 
infinitely many integer solutions of this equation from one nonzero integer solution 
(a, b) in the following way: starting from a* — 2b? = 1, we write 
(a+ bV2)(a — bV2) = 1 
and, hence, 
(a+ bV2)° (a — bV2)* = 1. 

Expanding both binomials, we arrive at 

(a? + 2b* + 2abV2)(a* + 2b? — 2abV2) = 1 
or, which is the same, at 


(a? + 2b7)* — 2(2ab)* = 1. 


5 After John Pell, English mathematician of the seventeenth century. 
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Therefore, (a2 + 2b”, 2ab) is also an integer solution and, letting a, b € N, we have 
a < a~ + 2b’. Iterating the above argument, we obtain infinitely many solutions for 
the given equation. oO 


The method used in the previous example can be easily generalized to show that 
the general Pell’s equation x? — dy* = 1 admits infinitely many nonzero integer 
solutions, provided it admits at least one such solution. Moreover, as illustrated by 
Problems 4 and 7, for instance, small modifications in that approach also allow us 
to treat more general equations. 

Even tough we can easily find infinitely many integer solutions of (7.4) in some 
particular cases, for the time being we do not know whether or not there are other 
integer solutions, not to say how to find all of them. In order to answer these 
questions, we shall need a preliminary result, due to Dirichlet, on approximation of 
irrational numbers by rational ones. For what comes next, recall (cf. the discussion 
preceding Problem 15, page 178) that, for x € R, the fractionary part of x is the 
real number {x} € [0, 1), given by {x} = x — |x]. 


Lemma 7.7 (Dirichlet) Jf « is an irrational number, then there exist infinitely 
many rationals a with x and y nonzero, relatively prime integers such that 


1 
= (7.5) 


A 


Proof Letn > 1 be any integer and consider the n + | real numbers {ja} € [0, 1), 
with j = 0,1,...,n. Since 


[0D] 0, — PU — PU see ,1), 
n n'n n 


a union of n disjoint half-open intervals, pigeonhole’s principle guarantees the 
existence of indices 0 < k < j <n such that {ja} and {ka} belong to a single 
of these n half-open intervals. 


Then, |{ ja} — {ka}| < 4 or, which is the same, 


1 
IG — Ka — (Ljo] — lkal)| <—. 


It thus follows that 


Lja] — [ka i 1 
< < ; 
f=% (j—k)n ~ (j —k)? 


(7.6) 


and letting x = | ja] — [ka] and y = j —k, we have O < y <n and 
On the other hand, if d = gcd(x, y) and x = dx1, y = dy, then 


x 
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Xx] 1 1 
aj)< zS >> 
y1 y Jy 


and we can assume that gcd(x, y) = 1. 
In order to guarantee the existence of infinitely many such pairs, let x and y be 


nonzero, relatively prime integers such that F - o| < a By choosing a natural 


number 7, for which = - a > a and repeating the argument that led to (7.6) 


with n; in place of n, we obtain nonzero, relatively prime integers x; and y; such 
that 0 < y; <n, and 


X{ 1 1 
a| < a 
a mY yy 
Moreover, since 
bal 1 1 x 
a| < < < al, 
yi ny n\ y 


we conclude that [ 4 *. 
Finally, by iterating the above reasoning we obtain infinitely many (x, y) of 
nonzero, relatively prime integers satisfying (7.5). oO 


As we shall now see, Dirichlet’s lemma allows us to show that, for a fixed square 
free natural number d > 1, there exists at least one integer value of m for which (7.4) 
admits infinitely many integer solutions. 


Lemma 7.8 [fd > 1 is a square free natural number, then there exists m € Z \ {0} 
such that equation x* — dy” = m has infinitely many integer solutions. 


Proof Since /d ¢ Q, Dirichlet’s lemma assures that the set S of ordered pairs 
(x, y) of nonzero, relatively prime integers x, y satisfying = vd | < 2 is infinite. 


1 
lyl’ 


However, if (x, y) is such a pair, then |x — yVdl < 
gives us 


and triangle inequality 


1 
Ix? — dy?| = |x — yVdllx + yVdl < a (Ix men Fie 2\ylvd) 


1/1 
= 5 (4 +2014) Sn d 1: 


Thus, for each (x, y) € S, the set of integers x” — dy? is contained in the set of 
the nonzero integers of the interval ( _ (2/d +1), Qld + 1). But since this is a 
finite set, there exists an integer m 4 0, situated between ~(2/d +1) and Q/d+ 1, 
that repeats itself an infinitely number of times among the values of x? — dy?, for 
(x, y) € S. Of course, this is the same as saying that, for this value of m, equation 
x* — dy* = m admits infinitely many integer solutions. oO 
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We are finally in position to characterize all integer solutions of (7.4) when m = 
1; the key ingredients will be the previous lemma and Fermat’s method of descent. 
We start by establishing the existence of solutions. 


Proposition 7.9 [fd > 1 is a square free natural number, then equation x” —dy* = 
1 admits at least one solution in positive integers x and y. 


Proof Take, by the previous lemma, m € Z \ {0} for which the equation x” —dy* = 
m has infinitely many integer solutions. Since there are only finitely many different 
remainders upon division by m, we can choose two such solution, say (x1, 1) and 
(x2, y2), such that |x,| ~ |x2| and m divides x2 — x1, y2 — y,. Then, 

(x1 + yiV'd) (x2 — yoV'd) = (e122 — dyiy2) + G2y1 — x1y2)Vd (7.7) 


and, by writing x. — x; = mr and y2 — yj = ms, withr, s € Z, we get 


xx. — dyiy2 = x1(m2 — x1) + (2? — dy?) + Oi — dy 
= m(rx; + 1—sdy,) 


and 
X2¥1 — X1y2 = (x2 — x1) 1 + X101 — y2) = m(r —S). 


For the sake of simplicity, let xjx2 — dy, y2 = mu and xy, — x; y2 = mv, with 
u,v € Z. It follows from (7.7) that 


(a1 + yrVd) (x2 — yrv'd) = m(u + vv‘) (7.8) 
and, hence, 
(a1 — Vd) (a2 + yrV'd) = mu — vv‘). 
By multiplying the left and right hand sides of these two relations, we arrive at 
m* = (x? — dy?)(x3 — dy3) = m?(u* — dv”), 
so that u2 — dv? = 1. 
We are left to showing that u, v 4 0. If w = 0, we would have —dv” = 1, which 


is clearly impossible. If v = 0 we would get u = +1, and it would follow from (7.8) 
that 


(x1 + y1Vd) (x2 — yoVd) = tm = +(x5, — dyd); 


therefore, 
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xp t+ yd = £(x2 + y2V 4d), 


which in turn implies |x;| = |x2|, for /d ¢ Q. But this conclusion contradicts our 
choices of x; and x2, and we are done. oO 


We are finally able to state and prove the result we have been seeking, on the 
characterization of all integer solutions of Pell’s equation x* —dy* = 1, for a square 
free integer d > 1. Observe that the previous proposition guarantees that it has at 
least one solution. 


Theorem 7.10 [fd > 1 is a square free integer, then the equation x? — dy? = | 
admits infinitely many solutions in positive integers x, y. More precisely, if x = x1, 
y = yy is the solution in positive integers for which the sum x + yd is as small 
as possible, then the other positive integer solutions of the equation are given by the 
natural numbers Xn, yn Satisfying the equality 


Xn + ynVd = (x1 + iva)", 


wheren €N. 


Proof Leta =x, + yivd, with x;, yj € N being chosen as in the statement of the 
theorem. 
Given n € N, the binomial expansion formula readily gives xn, y, € N such that 


(x1 SE yd)" =X, yaw d. 


Thus, 


1 = (aj —dyz)" = (a1 + yd)" — yd)" 
= (Xp + ynVd)(Xn — yn) = - - dy?, 
so that all of the ordered pairs (xn, y,) built as in the statement of the theorem are 
indeed solutions of the given equation. 
Now, if (x, y) is any solution in positive integers, it suffices to show that there 
exists n € N for which x + yVd = a”. For the sake of contradiction, assume that 


the converse is true. Since a > 1, we have limy_, +40 a” = +00, so that there exists 
n € N for which 


a” <x+yVd <q"! 
or, which is the same, 


x+yJd 
< ——"_ <a 


a” 


1 (7.9) 
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However, relations a” = x, + ynvd and Fa —d ye = 1, would give us 


xtyVd — xtyvd 


a” Xn + ynvd 
= (Xn — ynVd)(x + yd) 


= (ain — dyVn) + ny — Ynx)Vd, 
with 
(xxn — dyn)” — dOny — yn)? = Oy — dyn) OP = dy*) = 1. 
Hence, (xxXn — dyyn, XnY — YnX) is an integer solution such that (by (7.9)) 


x+tyVd 
1 < (xx — dyyn) + (Xny — ynx)Vd = —_ =e 


Thus, if we show that xx, —dyyy, X,Y —Ynx > 0, we would arrive at a contradiction 
to the minimality of a (note that, here, we are invoking Fermat’s descent method). 
For what is left to do, letting a = xx, — dyy, and b = x,y — y,x, we have 


d 
yg =O i and a2 — db? = 1, 
a 


so that 


1 
a — bV/d = ———— > 0. 
a+bJ/d 


Then, on the one hand, 
2a = (a — bVd) + (a + bVd) > O; 


on the other, 


1 
a — bVd = ——— <1 
a+bJd 


gives us b/d > a — 1 > 0, so that b > 0. oO 
Example 7.11 Find all positive integer solutions of equation x7 — 2y? = 1. 


Solution Letting (x1, y,) be the solution in positive integers for which x; + y1 J2is 
as small as possible, the previous result teaches us that the positive integer solutions 
of this equation are of the form (x7, yy), where x, and y, are such that 


Xn + ynv2 = (x, + yiv2)", 


forn EN. 
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Since the ordered pairs (1, 1), (1, 2), (2, 1), (2, 2), (3, 1), (1, 3), (2, 3) and (4, 1) 


are not solutions of the given equation but (3, 2) is a solution, it is immediate to 
check that x; = 3 and yj = 2. Hence, the remaining solutions in positive integers 
are the ordered pairs (xn, yn) given by the equality 


an + ynV2 = (3 +272)". 


Problems: Sect. 7.2 


1. 


2. 


io’) 


* If d and m leave remainder 3 upon division by 4, show that (7.4) does not 
possess any integer solutions. 

With respect to Example 7.11, prove that the positive integer solutions (xy, yn) 
of x* — 2y? = | are recursively defined by x; = 3, yj = 2 and, for an integer 
n>1, 


Xn+1 = 3xXn +4yn, and yn41 = 2Xn + 3yp. 


. Prove that the equation x7 — 2y* = —1 has infinitely many integer solutions. 
.* Letd,m € N, with d > 1 being square free. If equation x* — dy? = m 


has a solution (xo, yo) in positive integers, prove that it has infinitely many such 
solutions. 


. (Hungary) Show that there are infinitely many positive integers n for which n* + 


(n + 1)? is a perfect square. 


. (Turkey) Find infinitely many solutions, in positive integers, of the equation 


ytl=xa+y). 


. Generalize the result of the previous problem in the following way: leta, b,c € Z 


be such that A = b* — 4ac is greater than | and square free. If n € Z is such that 
the equation 


a Ay? = 4an 


has at least one integer solution (x9, yo) for which 2a | (xo — byo), show that the 
equation 


ax” + bxy + cy” =n 


has infinitely many integer solutions. 


Chapter 8 m®) 
Arithmetic Functions hook for 


This short chapter introduces an important class of functions, called arithmetic 
multiplicative functions, which play a prominent role in the elementary theory of 
numbers. Among the many arithmetic multiplicative functions we shall encounter 
here, two deserve all spotlights: the Euler function gy, which will reveal itself to 
be an indispensable tool for basically all further theoretical developments, and the 
Mobius function jz, which is essential to getting the celebrated Mébius inversion 
formula and its subsequent application to the Euler function. 

In all that follows, we shall refer ourselves to a given function f : N > Ras an 
arithmetic function. 


Definition 8.1 An arithmetic function f : N — R is said to be multiplicative if, 
for all relatively prime m,n € N, one has f(mn) = f(m) f (n). 


Since 1 is relatively prime with itself, if f : N — R is an arithmetic 
multiplicative function, then f(1) = fd)’, so that f(1) = Oor 1. Incase f(1) = 0, 
we get 


fa@)=fa-D=f@fM =90, VneN, 


i.e., f vanishes identically. Hence, from now on we shall implicitly assume that if 
f :N— Ris an arithmetic multiplicative function, then f(1) = 1. 

Note also that if f : N > R is an arithmetic multiplicative function andn > | is 
an integer with canonical decomposition n = oe ae ps then several applications 
of the definition give us 


fn) = f (pt). Fe). (8.1) 


In words, the above inequality says that, in order to compute the values f(n), with 
n €N, it suffices to know how to compute the values f(p%), with p prime and 
aeéeN. 
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Finally, if f : N — R is an arithmetic function such that f(1) = 1, then the 
equality f(mn) = f(m)f(n) is always true when m = | orn = 1; therefore, in 
order to prove that such an f is multiplicative, we just have to consider the case of 
m,n> 1. 

We shall repeatedly use the above remarks without further notice. 


Example 8.2 We saw at Corollary 6.46 that ifn = oe bbe Pp is the canonical 
decomposition of a natural number 1 > | in primes, then 


d(n) = (a, +1)... (ae +1) 


is the number of positive divisors of n. We claim that the function d : N > R thus 


obtained is multiplicative. Indeed, if m,n > 1 are relatively prime integers with 


canonical decompositions n = pf! ... pp and m = qh eae then p; 4 qj; for 


all 1 <i <k,1< j </. Therefore, the canonical decomposition of mn in primes 
is 

mmi=p," wpe ede 
so that 


d(mn) = (a, +1)... (ax + D(B1 +1)... (8) + 2) = d(m)d(n). 


For what comes next, given n € N we shall write D(n) to denote the set of 
positive divisors of n, so that d(n) = |D(n)|. 
Lemma 8.3 [fm and n are relatively prime naturals, then the map 
f : Dam) x D(n) — D(mn) 
(x, y) > xy 
is a bijection. 


Proof It follows from Example 8.2 and the fundamental principle of counting that 
|D(mn)| = d(mn) = d(m)d(n) = |D(m)| -|D(@)| = |D(m) x Din)|. 


Hence, the domain and codomain of f have equal quantities of elements. Thus, in 
order to prove that f is a bijection, it suffices to establish its surjectivity. To this end, 
let us apply item (d) of Proposition 6.22: if k | mn, then the condition gcd(m, n) = 1 
guarantees that 


k = gcd(k, mn) = gcd(k, m) - gcd(k, n). 
Therefore, if we let a = gcd(k, m) and b = gcd(k, n), we geta € D(m),b € D(n) 


and f((a, b)) = ab = k, so that k belongs to the image of f. But since k € D(mn) 
was arbitrarily chosen, we conclude that f is surjective. oO 
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Along the rest of this chapter, we shall write )°o_ din J (d) to denote the sum of 
the values f(d), when d varies in D(n). Although such an use of the letter d is 
conflictant with the notation for the function d of the previous example, this will be 
harmless in context. 

The coming proposition establishes one of the most important properties of 
arithmetic multiplicative functions. 


Proposition 8.4 [f f : N — R is an arithmetic multiplicative function, then so is 
the function F : N > R, given by 


F(n)= D> f@. 


O<d|n 


Proof If m,n € N are such that gcd(m, n) = 1, then the previous lemma, together 
with the multiplicative character of f, gives 


Fim) = DY) f@= YO fadid)= Yo YO fadf@) 


O<d|mn O<d|m 0<d|m 0<d2|n 
O<d)|n 
=| >> fay) | > f@ 
O<d,|m 0<d2\|n 
= F(m)F(n). 


oO 


Example 8.5 The function f : N — R such that f(n) = n for every n € N 
is obviously multiplicative. Hence, the previous proposition assures that so is the 
function s : N > R, given by 


sn)= D> f= Yoda. 


O<d|n O<d|n 


Since s(n) adds the positive divisors of n, given a prime number p anda €N, it 
follows from Corollary 6.46 that 


ba ; pr —1 
= Do d= oe = 
i=0 


O0<d|p” 
Thus, ifn = pj'...p;*, with primes py < --- < px and natural numbers 
Q1,..., Ax, relation (8.1) and the above computations give 
j+1 
k ie k py i 
s(n) = [[s@; = I] =. ; (8.2) 
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For our next example, we recall (cf. Problem 6, page 190) that an integer n > 1 
is square free ifn = p,... pg, with p) < --- < px being primes; equivalently, n is 
square free if there does not exist an integer g > 1 such that q? | n. 

The coming definition presents one of the most important arithmetic multiplica- 
tive functions multiplicativas, the Mobius function. 


Definition 8.6 The Mobius function! is the function 1. : N > R given by 


1, ifn=1 
u(n) = 4 0, if g?|n, paraalgum inteiro q > 1 
(=1)*, ifn = pj... Pe, COM pj,..., Pe distinct primes 


In order to check that the Mébius function is indeed multiplicative, let m,n > 1 
be relatively prime integers. Then, mn will be divisible by a perfect square greater 
than | if and only if either m or n do satisfy such a property; this being the case, it 
is immediate to see that 


f(mn) =0= f(m) f(n). 


On the other hand, if m and n are square free, say n = py...px andm = 
qi---q, With py <--- < pgrandq, <--- < q being prime numbers, the condition 
gcd(m, n) = | assures that p; ¢ q; for alli and j. Hence, mn = p,... peqi..- qi 
is the canonical decomposition of mn in primes, so that 


umn) = (—- 1)! = (-1)*(-1)! = we) ue). 


The coming result brings a very important property of the Mobius function. 


Proposition 8.7 [f 1: N — R is the Mébius function, then 
1, ifn=1 
dj=;. : 
2d, Hd) i ifn>1 
O0<d|n 


Proof Let F : N > R be the function given by 


Fin)= DY) ud). 


O<d|n 


Proposition 8.4 guarantees that F is multiplicative, and we wish to show that F(1) = 
1 and F(n) = 0 forn > 1. We consider three cases separately: 


(i) n = 1: we have F(1) = Vo«<d|1 uw(d) = wl) = 1. 


' After the German mathematician of the nineteenth century August Mobius. 
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(ii) n = p*, with p prime and k > 1 integer: then 


k 
F(p')= Yo w@ =>) up’) = w) + wp) = 0. 


0<d| p* j=0 


(ii) n = Br ban Pes with p; <--- < px being primes: since F is multiplicative, 
it follows from (8.1) and item (ii) that 


F(n) = FG, Vee GE) = 0. 
Oo 


Theorem 8.8 is the reason behind the central role of Mobius function in the theory 
of arithmetic multiplicative functions. For its proof, notice that the function 


f : Din) > D(n) eu) 
d +> n/d 
is a bijection; indeed, this follows at once from the fact that f o f = Idpyn), as the 
reader can promptly check. 
In the notations of Proposition 8.4, formula (8.4) below, known as the Mobius 
inversion formula, teaches us how to recover function f from F (even if f is not 
multiplicative). 


Theorem 8.8 (Mobius) Let f : N > R be an arithmetic function (not necessarily 
multiplicative). If F : N — R is given by F(n) = ae f (@), then 


fin) = Yo F(S)n@= YS F@u(5). (8.4) 


O<d|n O<d|n 


Proof The second equality in (8.4) follows from the bijectivity of the function 
in (8.3). For the first equality, note first that 


Ds F (5) u@ = Yi te@ YS £f@))= >> YS ue@se@’. 
O<d|n O<d|n O0<d'|5 O0<d|n 0<d'| 5 


However, since d | n and d’ | 4 if and only if d’ | n and d | <,, it follows from the 
above computations that 


~FG)e@= SY w@s@= VY [v@ YL ww 


O<d|n 0<d'|n 0<d| 5, O<d'|n O<d| a 
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Now, if 7 > 1 (i.e., if d’ < 1), then Proposition 8.7 furnishes 


Y> pwd) =0. 


O<d| 4 


Therefore, the next to last sum above reduces to the summand corresponding to 
d' =n, so that 


Yo 1 f@) YS u@] =f) VO u@ = fu) = fo. 


O<d'|n O<d| ir O0<d|1 


oO 


Before we proceed with the development of the theory, we present an example 
which shows how Mobius inversion formula can be a useful tool in counting 
problems. 


Example 8.9 Let n € N be given. We say that a sequence (x1, %2,..., Xn), with 
x; € {0,1} for 1 < j <n, is aperiodic if there does not exist a divisor 0 < d < 
n of n such that the sequence is formed by the juxtaposition of 7 copies of the 
subsequence (x1,...,Xq). Compute, in terms of n, the total number of aperiodic 
sequences of n terms. 


Proof First of all, the fundamental principle of counting assures that there exist 
exactly 2” sequences (x1, X2,..., Xn) with x; € {0, l}forl <j <n. 

On the other hand, for such a sequence (x1, x2,..., Xn), we define its period as 
the least positive divisor d of n for which the sequence is formed by the juxtaposition 
of n/d copies of the subsequence (x1, ..., xq). In particular, an aperiodic sequence 
(x1, X2,...,Xn) has period n. 

More generally, if the sequence (x1, x2,..., Xn) has period d, then (x1,..., xa) 
is necessarily aperiodic, and conversely. Hence, if ax denotes the number of 
aperiodic sequences of k terms, we get 


> aq = 2". 


O<d|n 


If we now apply the Mobius inversion formula, we obtain 


as wished. oO 


The theory developed so far allows us to introduce and study the main properties 
of another quite important arithmetic function, according to the following 
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Definition 8.10 Euler’s g function is the function g : N > N given by 
p(n) = #{1 <k <n; ged(k,n) = 1}. 


In words, y(n) counts how many integers from | to n are relatively prime with 
n. In what comes next, among other properties we shall show that the function @ is 
multiplicative. We shall also use this result to compute g(7) in terms of the canonical 
decomposition of n in prime factors.* We start with a result which will be useful in 
other circumstances. 


Proposition 8.11 With respect to Euler’s function p : N > N, we have 


ea = Y o(S) =n. 


O<d|n O<d|n 


Proof The first equality comes from the bijectivity of function f in (8.3). For the 
second one, let D(n) = {1 = aq, < a <--- < qa =n}. If 1 < k <n, then 
gcd(k,n) € D(n), ie., gcd(k,n) = a; for some 1 <i < t. Therefore, letting 
Aj = {1 <k <n; gcd(k,n) = a;}, we get 


th, = A, U...UA;, 
a disjoint union. Hence, n = poe |A;|. Now, note that 
Aj ={1 <k <n; gced(k,n) =a;} 


= {1 <k/aj <n/aj; k/a; € N and ged (k/a;,n/aj;) = 1} 
= {1 <1 <n/a;; gced(/,n/a;) = 1}, 


so that |A;| = yg (4). Thus, 


O<d|n 


oO 


For the coming result, also due to Euler, we shall need the fact, left as an exercise 
to the reader (see Problem 1), that the product of two arithmetic multiplicative 
functions yields yet another such function. 


Theorem 8.12 (Euler) The Euler function g : N > N is multiplicative. 


? Another approach was the object of Problem 6, page 40. 
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Proof Letting G(n) = n, the previous proposition gives us )°y_ d\n P(A) = G(n). 
Therefore, it follows from Mobius inversion formula that 


n n (da) 
en) = Y> w@G(5)= Yo n@- Fan DR. (8.5) 
O0<d|n O<d|n O<d|n 
However, since f(d) = u@) is a multiplicative function (verify this assertion!), 


Proposition 8.4 shows that function F : N > R, defined by 


is also multiplicative. Hence, Problem | assures that g(n) = nF (n) is multiplica- 
tive, too. oO 


As we have promised before, the coming corollary relates g(n) with the 
canonical decomposition of an integer m > 1 in prime factors. 


Corollary 8.13 (Euler) Jf the canonical decomposition of an integer n > 1 in 
primes is given byn = py 9 Be then 


vin) =n(1-—)...(1-=). (8.6) 
Pi Pk 


Proof We shall firstly compute the value of p(p%), with p prime and aw > 1 integer: 


p(p*) =#{1 <k < p®; ged(k, p®) = 1} 
= #{1 <k < p*; gcd(k, p) = 1} 


SH 28s ep POP AO) 


Now, since ¢ is multiplicative, it follows from (8.1) that 


p(n) = v(py')...9(pe") 
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The formula of the previous corollary allows us to deduce many interesting (and 
useful) properties of the Euler function. The coming example presents such a result. 


Example 8.14 Given m € N, show that the equation y(n) = m has at most a finite 
number of solutions n € N. 


Proof Firstly, ifn = pr, with a > 1, p prime and gcd(p,r) = 1, then the 
multiplicative character of the function g gives 


m = 9(n) = o(P")9(r) = p*"(p — Yer) = 2°", 
for g(r) => 1 and p => 2. Hence, taking logarithms at base 2, we obtain 
a <1+ log, m. 
Moreover, analogous computations furnish 
m= p*'(p— gr) = pI, 


whence p< m-+1. 
Now, ifn = ot ae pe is the canonical decomposition of n in primes and a = 
max{a@1,..., a}, it follows from what we did above that a < 1+ log, n and, hence, 


1+logy m 
n<(pi...pe)* <( [] ») 


psm+l1 
p prime 


Problems: Chap. 8 


1. * Prove that the product of two arithmetic multiplicative functions is also an 
arithmetic multiplicative function. 
2. Prove that, for every n € N, we have TTo<an d=n 


Prove that ea > ./n, for every n EN. 

4. (OCM) A teacher chose a positive integer n and, then, posed the following 
problems to two of his students: the first one should compute the number of 
ordered pairs (x, y) of positive integers satisfying the equation i + t = ; the 
second should do the same regarding the equation i a 1 Knowing that 
the sum of the answers found by the students was 78, show that at least one of 
them made a mistake. 

5. (Hungary—adapted) For n € N and 0 <r < 3, let D,(n) be the set of positive 


divisors of n leaving remainder r upon division by 4. 


d(n)/2_ 


~ 
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(a) If gcd(m, n) = 1, prove that there exists a natural bijection 
(Di(m) x Di(n)) U (D3(m) x D3(n)) —> D\(mn). 


Do the same with respect to (D,(m) x D3(n)) U (D3(m) x Di(n)) and 
D3(mn). 
(b) Prove that |D;(n)| => |D3(n)|. 


. Anatural number n > | is said to be perfect if s() = 2n, where s(n) denotes 


the sum of the positive divisors of n. Prove that: 


(a) Ifn > 1 is perfect, then Vo<d\n 4 = 2. 
(b) If p is a prime number for which 2? — 1 is also prime, then 2?~!(2? — 1) 
is perfect. 


. The purpose of this problem is to establish the converse of the previous 


problem, thus proving the following theorem of Euler: if n is an even perfect 
number, then there exists a prime number p such that 2? — 1 is prime and 
n = 2?-1(2P — 1). To this end, let n = 2*q be an even perfect number, with 
k,q € N and q being odd. Do the following items: 


(a) Conclude that (2‘+! — 1)s(q) = 2'+!q. Then, show that there exists a € N 
for which g = (2+! — 1)a and s(q) = 2**1a. 

(b) If a = 1, show that 2‘+! — | is prime, whence k + 1 = p, a prime number. 

(c) Ifa = 2+! — 1, show that s(q) > 1+a+a’ > (a+ Da = 2**1a, which 
is a contradiction. 

(d) Ifa > Landa 4 2‘+!—1, theng has at least four distinct positive divisors: 
1, 2'+!_1, a and (24+! — 1)a. Use this fact to conclude that s(q) > 2*+!a, 
thus arriving at a new contradiction. 


ww 


. A natural number n is abundant if s(n) > 2n, where s(n) denotes the sum of 


the positive divisors of n. If a € N is abundant, show that ab is also abundant, 
regardless of the value of b € N. 


. Let f : N > R be defined by f(1) = 1 and, forn > 1, 


ely 
ff") = ———_,, 
P\P2.--+ Pk 
where p1, p2,.--, Px are the distinct prime divisors of n. Find all n € N for 


which 2 dean f(d) =0. 
If f : N > Ris an arithmetic multiplicative function, prove that 


y> ud) fd) = [] d- fo). 
O0<d|n ap 
pin 


Then, use the above result to establish the following items: 


3Up to this day, no one knows whether or not there exist odd perfect numbers. 
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11. 


12. 


13. 


14. 


15. 


16. 


(a) Doda du(d) = Tpsine (1 — Dp). 
pin 


(b) Yo d\n (dy? = 2* where k is the number of distinct prime divisors of n 
(note that k = Oifn = 1). 


Prove Liouville’s theorem‘: for each n € N, one has 
2 
(iam) = YO aw’. 
0<j|n O0<j|n 


* Let f : N — R be any function and F : N —> R be given by F(n) = 
Vo<a\n f (d). Prove that 


Yi F® = 
k=1 


Let f : N > {-1, 1} be given by f(1) = 1 and f(n) = (—1)"'*"'T, where 
n= p ree ioe is the canonical decomposition of an integer n > | in primes. 
Prove that, for every integer n > 1, one has 


y [5] ra =a. 
jail 


| 2] 70. 


j=l 


(Brazil) Prove that, for every natural number n > 1, we have 
n 
Be eae idk) < ee eet 
mo. Ss i EO NE aS n) 


Let F be an arithmetic multiplicative function and f : N —> R be implicitly 
defined by the equality F(n) = Vo<din f (d), for every n € N. Prove that f is 
also multiplicative. 

The purpose of this problem is to give another proof of the multiplicative 
character of Euler’s function g. To this end, given relatively prime integers 
m,n > 1, arrange all natural numbers from | to mn as shown in the table 


1 Qe ki... m 
m+ 1 m+2... m+k...2m 
2m+1 2m+2... 2n+k...3m 


Ce ee es ee 


and do the following items: 


4 After Joseph Liouville, French mathematician of the nineteenth century. As we shall see in 
Sect. 20.4—cf. Theorem 20.31—Liouville was the one to give the very first example of a 
transcendental number. 
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17. 


18. 


19. 


20. 


21. 
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(a 


Nar 


Prove that an entry of the table is relatively prime with mn if and only if it 
is relatively prime with both m and n. 

Show that, in any column, either all elements are prime with m or none of 
them satisfies such a property. 

Prove that there are exactly y(m) columns formed by integers relatively 
prime to m; moreover, each one of these columns contains precisely g(7) 
entries relatively prime with n. 

(d) Conclude that g(mn) = g(m)g(n). 


(b 


wm 


(c 


Ne 


If F(n) = Yoe din a. compute F’ (7) in terms of the canonical decomposition 
of n. 

(OIM shortlist) For each m € N, let A» be the set of ordered pairs (d,n) of 
integers such that d is a positive divisor of m, 1 <n < mand gcd(d,n) = 1. 
Find all m € N for which |A,,| = 1993. 

* For an integer n > 2, prove the following items: 


(a) If Pp = {1 < k <n; gced(k,n) = 1}, then the correspondence k bh n —k 
is a bijection of Py. 
(b) g(n) = 21, where / is the number of elements of P,, which are less than or 


equal to ut ; in particular, y(n) is even. 


Given m,n € N, withn > 2, let 1 = a, <--- < ay =n -— 1 be the positive 
integers prime with n and less than or equal to n, and S,,(n) = ee a." be the 
sum of their m-th powers. Establish the following: 


(a) Ifk = 21, then Sj,(n) = ae (a;” + (n — a;)""). Then, use this relation to 
conclude that S;, (7) is even whenever n is itself even. 

(b) Sm(n) = D_o(- LY! (Gan Sj(n). 

(c) If m is odd, then 2S,,(n) = n reg (-1) i ae S;(n). Then, show 
that n | S,,(n) if m is odd. 

In the notations of the statement of the previous problem, show that: 


a) Every 1 < m <ncan be uniquely written asm = 4-a, witha, d €¢ N such 
y quely d 


that d | n and gcd(a, d) = 1. 


Sed) _ 1k&42k4...4nk 
(b) ean dk nk 7 


(©) Sela) =H Soca (4) (“2 FE*“). 
(d) Si(n) = 5ny(n). 


(e) So(n) = 4n? y(n) + én Toren (1 — p). 
p\|n 


Chapter 9 m®) 
Calculus and Number Theory cen 


In this chapter, we assume that the reader is conversant with the rudiments 
of Calculus. More precisely, we shall assume from the reader familiarity with 
convergent sequences and series, as well as with the notions of limits and derivatives 
of functions. In this sense, the material of [8] covers all of what is necessary. 
Here, we shall present some basic examples and results on the distribution of prime 
numbers, having the classic book of professor G. Andrews [2] as guideline. We shall 
also discuss a rather interesting asymptotic result of Cesaro, on pairs of relatively 
prime natural numbers, following [29]. 


9.1 On the Distribution of Prime Numbers 


One of the first results on prime numbers we have learned in this book was Euclid’s 
theorem, which asserts that the set of primes is infinite. Nevertheless, as was already 
pointed out in Remark 6.39, there is an distinctive lack of uniformity on the way 
these numbers distribute along the naturals. It is, therefore, quite astonishing that a 
result such as Hadamard’s Prime Number Theorem, quoted below, actually holds 
true. Before we state it, let us set some notation: for each positive real x, we shall 
write (x) to denote the number of prime numbers less than or equal to x. 


Theorem 9.1 (Hadamard!) In the above notations, one has 


1 (x) 
lim —— =1, (9.1) 
x>+00 x/ log x 


where log : (0, +00) — R stands for the natural logarithm function. 


‘Jacques Hadamard, French mathematician of the nineteenth century. 
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Although a proof of the above theorem is well beyond the scope of these notes, 
we refer the interested reader to the marvelous book of professor T. Apostol [5] for 
a self-contained approach. However, we stress that Hadamard’s theorem assures that 
the functions z(x) and fay are asymptotically equal, in the sense that limit (9.1) 
holds true. 

A few years earlier than Hadamard, the Russian mathematician P. Chebyshev 
obtained a much simpler (yet rather interesting) result on prime distribution, 
guaranteeing the existence of positive constants c and C such that 


x 


<A@<sc : 
log x log x 


for every real x > 2. 

A proof of the second one of the above inequalities will be the object of 
Problem 4; for a proof of the first one, we recommend to the reader the book of 
professor Andrews, already quoted above. On the other hand, it readily follows from 
the second one of the above inequalities that 


W(x) 2 C 
x ~ logx 


for every real x > 2; in particular, one has 
lim —=0. (9.2) 


In words, (9.2) says that the total quantity of prime numbers less than or equal 
to x is an infinitesimal with respect to x (i.e., grows much slower than x itself) as 
x — +00. In what follows, we shall give a direct prove of the validity of such a 
result. We start with the following preliminary 


Proposition 9.2. [fk is a natural number, then 


U(x) Z y(k) =: 2k 
x k x 


’ 


where p denotes Euler’s function. 


Proof For a real number x > 0, let g,r € Z be such that |x] = kq +r, with 
0 <r <k. Note that 


q-1 
10 Be rey (EN) = (Ute + 1.47 +2....kG + DIU 


j=0 


O tk bl hg ey kG aE T], 


(9.3) 
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From | to k there is at most k prime numbers. On the other hand, if j > 1, then, 
for an element of {kj + 1,kj7 +2,...,k(j + 1)} to be prime, it must necessarily be 
relatively prime with k. Hence, such a set contains at most y(k) prime numbers. 

Now, since g = Fal , (9.3) allows us to write down the following upper estimate 
for 2(x): 


r(x) <k+ q— Dek) +r <2k+ |= |p) <2 + To. 


Finally, in order to get the stated result, it suffices to divide both sides of the last 
inequality above by x. Oo 


We shall also need the following technical 


Lemma 9.3 [fm > | is an integer and p, p2,..., Pn are the prime numbers less 
than or equal to m, then 


Erlaer-Cal 


Proof Formula (3.1) for the sum of a convergent geometric series gives 


O--a) aT -ea) 


On the other hand, the choice of p1, p2, .-., Pn, together with the fundamental 
theorem of arithmetic, guarantees the validity of the inclusion 


r 1 | T _ n> 0) 
Sess ——- — 3 JlsJ2.-+-5 Jn 2 : 
9 m PP, ep, 


Therefore, 
m n 
1 1 1 1 
mee 2D aoe (i+ eae), 
j=! Ais J2s5Jn20 P| Pz --+Pn i=1 i 
and the desired result promptly follows. oO 


The previous lemma, together with the divergence of the harmonic series (cf. 
Example 7.37 of [8], for instance), gives 


1 i 
lim [-=)---(-=)] ae 
n— +00 P\ Pn 


where pj stands for the j-th prime. Therefore, 
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1 1 
lim (1-—)...(1-—)=0 (9.4) 
n— +00 Pi Pn 


With the previous results at our disposal, we are finally in position to prove the 
promised weak version of Hadamard’s theorem. 


Theorem 9.4 limy_.+00 ze) = 0. 


Proof We have to prove that, given e > 0, there exists x9 > O such that x > 
xo > ms) < €. To this end, let pj, po, ..., Py be the first n prime numbers, and 
k = pi p2..- Pn. Proposition 9.2, together with the formula for g(k), gives us 


70 <(1 ~) (1 ~)...(1 —) 4 Piha Pe (9.5) 
x P1 P2 Pn x 


Now, (9.4) allows us to choose a natural number n such that 


(-YO-D-24 


If we let xo = ee then, for x > x9, it follows from (9.5) that 


m(x)  € 2p, pr---Pn — € | 2PiP2--- Pn 
<S <is=+ 
x 2 x 2: xo 


=€é, 


as wished. oO 


The arguments that led to (9.4) used the fact that the harmonic series diverges: 
Nes 1 1 = +o. Heuristically, we can say that this is due to the fact that the 
sequence of naturals has too many naturals (actually, all of them). On the other 
hand, already in Lemma 3.15 we used the fact that the series of the inverses of 
perfect squares converges: Laan 4. < +00. This time, and also from a heuristic 
point of view, we can attribute the convergence of the series to the fact that the 
sequence of perfect squares contains too few naturals. 

In view of the above, a natural question poses itself in the context of prime 
numbers: if p, stands for the n-th prime number, does the series eel x have 
too many or too few naturals, in the above sense? Theorem 9.4 would encourage us 
to say that it does possess too few naturals. 

At this point our intuition fails, for we shall prove below another theorem of the 
great L. Euler, asserting the divergence of the series of the inverses of primes. To 
this end, we shall need the inequality below, which readily follows from (3.7): 


e>1+x, Vx>0. (9.6) 


>For a slightly different proof of such a result, see the problems of Section 10.9 of [8]. 
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Theorem 9.5 (Euler) Jf px denotes the k-th prime, then the series et x 
diverges. 


Proof If m € N and pj, p2,..., pe are the prime numbers less than or equal to m, 
it is immediate to see that 


ralt+o DL —+}2xh 


I<j</m Isick lsf<n<jisk PAPA Phi 


Now, observe that the second factor at the left hand side of the previous inequality 
is nothing but the sum of the inverses of the square free natural numbers less than or 


equal to m. Letting Mie jem ; denote this sum, we can write the above inequality as 
jsf ° 


1 1 m 1 
(© aE a)2hF 
Isj<J/m ces j=l 


Since > psi 5 diverges and )* jel 7 converges, we conclude that 


diverges. Finally, by the sake of contradiction, assume that the series of the inverses 
of primes converges to a certain real number a. Then, for every n € N, we get 
from (9.6) that 


1 1 1 1 
c' > exp( y =)= I] exp(<) = I] (1+-)- 5 -, 
p<n P p<n P p<n P l<j<n J 
p prime p prime p prime j sf 


where exp : R — R denotes the exponential function of basis e. But this is indeed 
a contradiction, for we already know that the sum of the inverses of the square free 
natural numbers diverges. oO 


Problems: Sect. 9.1 


1. Prove that there exists a positive real x9 for which m(x) > log for x > xo. 


2. For each integer k > 1, let ax, be the k-th natural number which is not a perfect 
square. Decide whether the series }°,. ; x converges. 
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3. 


4. 


For each integer k > 1, let a, be the k-th composite natural number. Decide 
whether the series 7, , - converges. 
The purpose of this problem is to show that 


x(x) < (30 log 2) ar (9.7) 


for every real x > 8, where log : (0,-+oo) — R denotes the natural logarithm 
function. To this end, do the following items: 


2n 


) is divisible by all prime numbers p 


(a) For every n € N, prove that ( 


satisfying n < p < 2n. Moreover, show that ( 2. 
(b) Prove that, for a natural n > 2, one has 27(2n) < a(n) + (2 log 2) oan’ 


(c) If f : (0, +00) > Ris given by f(x) = ieee 
(e, oo) and that f (3) < 13 f(x) forx > 8. 
(d) Use items (b) and (c) to conclude that r(2n) < (321log2) 


integer n > 2. 
(e) Deduce that (9.7) holds for every real x > 8. 


show that f is increasing in 


n 
rear for every 


. The purpose of this problem is to establish the converse of Problem 17, page 191. 


To this end, given n € N, let /(n) denote the sum of the greatest odd divisors of 
the numbers 1, 2, ..., 7 and do the following items: 


(a 


wm 


For n € N, let t(n) be the greatest odd divisor of n and i(n) the greatest odd 
integer less than or equal to n. Prove that 
I(n) = (t(1) + 73) +--+ + TG) 
+r) +t) + +-s +t 2la/2)) 


= rin) + 1)? +1 [n/2]). 


(b 


we 


In the notations of (a), show that 


t 


1 
Tn) = 7) qe) + D*, 
k=0 


where 2' is the greatest power of 2 which is less than or equal to n and, for 


0 < k <t, q is the quotient one gets upon dividing n by 2". 
(c) Use the fact that i(g,) < aE to get the estimate 


1) <4 seg ae 4 : +t+1 


(d) Use the fact that t = |log,n]| <n to get the estimate 


wm 
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An? + 15n +3 


I(n)s D 


(e) Use inequalities [log, n| > 0, 2082") > glesan-l — > and 


n 
tq)2=a-1=|5|-12 5-2 
to show, in an analogous way, that 


4n? — 12n —1 


I(n) > 
(m2 12 
(ff) If Tm) =14+2+---+n, conclude that 


4n? —12n —1 “ I(n) Z An? + 15n +3 
6n2+6n ~ T(n)~  6n24+6n 


Then, show that ifr € Q \ {3}, then ms = r for at most a finite number of 
integer values of n. 


9.2 Chebyshev’s Theorem 


The Prime Number Theorem guarantees that the difference m(2x) — a(x) is 
asymptotically equal to 


2x x x ( 2 log x :) 


log(2x)  logx ~ log x \log 2 + log x 


_ # logx — log2 
~ logx \logx + log2/° 


However, since 
log x — log2 
m Cl 
x +00 log x + log2 


we conclude that 2(2x) — a(x) is asymptotically equal to —*—. It happens (for 


log x* 
instance by L’ H6pital’s rule) that 


lim 
x>+00 log x 


so that the above reasoning assures that the number of primes between x and 2x 
grows unboundedly as x > +00. 
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In this section we give a complete proof of a much weaker result, also due to 
Chebyshev, which shows that there is at least one prime number between n and 2n, 
for every integer > 1. For the proof of it, we shall need the two following auxiliary 
results. 


Lemma 9.6 Letn, p € N, with p prime, and let 1p be the exponent of p in the 
factorisation of C), Then, 


=X] eb) 


Moreover, if Vp is the only integer such that p’? < 2n < prt! then [kp < Vp. 


Proof For the first part, since Cc) = mie 
ep (2n) 


by pYe = an? where e,(2n) and e,(n) are as in Legendre’s formula (cf. 


we readily conclude that jz» is given 


Proposition 6.10). Hence, 
Lp = €p(2n) — 2ep(n), 


and it suffices to use (6.10) to get the stated formula for jp. 
For the second part, it follows from the definition of v, that 


. 2 
j>vp>pi>mn|%|-2)4]=0 
p! p! 


On the other hand, for 7 > 1 we have 


2n n 2n n 
- 2 -|<—-2 -—1])=2, 
p! p/ p! p! 


so that | 25 | —2 Fa < 1. Therefore, gathering together the two pieces of 


Bi 
information above, we obtain 


= EFL a)-EC Fla) eee 


j2i j= 


Lemma 9.7 If x > 2 is a real number, then 


I] p<. 


psx 
P prime 
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Proof The lemma is clearly true for 2 < x < 3. On the other hand, if it is valid for 
x =n, where n > 3 is an odd integer, then it will also be valid form < y <n +2; 
indeed, in such a case n + | is even, so that 


I] p= I] p<4° <4. 


Psy psn 
p prime Pp prime 


It then suffices to show that the lemma is true for x = n, where n > 3 is an odd 
integer. To this end, we make induction on n > 3 odd, noticing that we already have 
the validity of the result for n = 3. By induction hypothesis, assume that it is also 
true for all odd integers less than a certain odd integer n > 5. Letk = ne with the 
sign chosen in such a way that k is odd. Then, k > 3 and n — k is even and such that 


n-k=2kFl—-k<k+l. 


Hence, if p is a prime number for which k < p <n, then p is odd, p | n!, p { k! 
and p { (n — k)! (it cannot happen that n —k = k +1 = p, since n — k is even). 
Thus, we conclude that the product of all such primes divides 


n\ n! 
(‘) ~ kin — kV 


I[ p< @ (9.8) 


k<p<n 
p prime 


so that 


Now, our choice of k assures that k and n — k are distinct. However, since () — 
(, - 3) and such binomial numbers are summands in the binomial expansion of 2” = 
(1 + 1)", we get (1) < 2”~', and (9.8) furnishes 


[[ ps2". (9.9) 
k<p<n 
P prime 


Therefore, the induction hypothesis and (9.9) give 


I] p=( I] r)( I] p) Panam, 


p prime p prime p prime 
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We have finally arrived at the desired result, which is known as Chebyshev’s 
theorem.” 


Theorem 9.8 (Chebyshev) For each integer n > 1, there is at least one prime 
number between n and 2n. 


Proof A general argument will prove the result when n > 128. For n < 128 take 
p=3ifn=2, p=S5 ifn =3 and: 


p=7if4<n<6; 

p= 13 if 7<n < 12; 

p = 23 if 13 <n < 22; 
p = 43 if 23 <n < 42; 
p = 83 if 43 <n < 82; 
p = 131 if 83 <n < 127. 


Now, assume the result is false for some integer n > 128. In the notations of 
Lemma 9.6, such a supposition guarantees that 


Pie 


p<2n psn 
p prime Pp prime 


However, for any prime p such that on < p <1, we have 


3 2n 
<-, and 2< — <3, 
2 Pp 
< 1; it thus follows that 
2n n 2n n 
mE La) Eb lg)22-° 
ja SEP! pi P P 


On the other hand, for any prime p such that /2n < p < 2, we have p” > 2n, 


and Lemma 9.6 gives 1 < py < vp = 1. Finally, for primes p such that p < V2n, 
we have, again by Lemma 9.6, that p“? < p’? < 2n. 


3Pafnuty Chebyshev, Russian mathematician of the nineteenth century. 
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The estimates above give us (in what follows, p denotes a prime number) 
2n Lp Lp Lp 
; I] » I] I] 


p</2n J2n<ps% 2 <p<n 


[| 2 I] Pp 


p</2n J2n<p< 
Qn 
<43{ [|] 2n}, 
psV2n 


where we used Lemma 9.7 in the inequality above. 
Since 9 and the even integers greater than 2 are not primes, the condition n > 128 
(= /2n > 16) assures that 


x (Vin) <1, 


Substituting this last estimate into (9.10), we arrive at 
2. N i n 
( ") < 43 nyt < 43 (Qnyv2-!, (9.10) 
n 


Finally, note that (*) is the largest among all of the 2n + 1 summands in the 


binomial expansion of 4” = (1 + 1)?”. Since the first and last of these summands 
are both equal to 1, we get 2n (") > 4” or, which is the same, @) > . In turn, 
such an estimate, combined with (9.10), furnishes 


n 


4 A nh 
~ AF QnyV2-1, 
2n 

which yields 

27 < Qnyv3. 


J2n 


Taking natural logarithms an dividing both sides by +4", we conclude that the above 
inequality can be rewritten as 


Vv 8n log2 — 3 log(2n) < 0. (9.11) 
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Letting f(x) = V8x log2 — 3 log(2x), we have f (128) = 8log2 > 0 and 


0 


Piw= = . 


for x > 128. Thus, f is increasing for x > 128 and (9.11) is false, which gives us 
the desired contradiction. Oo 


The famous Goldbach conjecture* claims that every even number greater than 
2 can be written as the sum of two (not necessarily distinct) primes. Although a 
serious account on the state of the art on Goldbach’s conjecture is far beyond the 
scope of these notes, Chebyshev’s theorem allows us to present a simple example 
on the possibility of writing natural numbers as sums of distinct primes. 


Example 9.9 Let P stand for the set of prime numbers. Show that every natural 
number can be written as a sum of distinct elements of the set {1} U P. 


Proof Letn > | be a natural number and p be the greatest prime number less than 
or equal to n. If n > 2p, then Chebyshev’s theorem would give a prime g between 
p and 2p, which would contradict the maximality of p. Therefore, p < n < 2p 
and, hence, 0 < n — p < p. Arguing by induction, we conclude that n — p can be 
written as a sum of distinct elements of the set {1} UP, and writing n = (n— p)+ p 
we conclude that the same is true of n. Oo 


Problems: Sect. 9.2 


1. If py is the n-th prime, prove that py+1 < 2py. 

2. * Prove that, for every integer n > 1, there exists a prime p whose exponent in 
the canonical decomposition of 1! is equal to 1. 

3. (TT) Prove that, for every integer n > 1, the number 1!2!---n! is not a perfect 
square. 

4. Find all natural numbers m,n, x and y such that m,n > 1 and (m!)* = (n!)’. 

For the two coming problems, the reader will find it convenient to use the 

following stronger version of Chebyschev’s theorem: for every integer n > 6, 
there are at least two primes between n and 2n. 

5. (OCS) Find all m,n € N such that (n — 1)!n! = ml. 

6. Find all natural numbers n > | such that every natural 1 < m < n which is 
relatively prime with n is indeed a prime number. 


4 After Christian Goldbach, German mathematician of the eighteenth century. 
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9.3. Césaro’s Theorem 


In 1881, E. Cesaro” proved that the odds that two randomly chosen natural numbers 
be relatively prime is equal to S. We finish this chapter by presenting an elementary 
proof of this fact. Our exposition follows a nonpublished manuscript of prof. Hudson 
N. Lima (cf. [29]). We start by recalling a few rather simple facts on the calculus of 
probabilities. 

Let E be a finite nonempty set, said to be the sampling space, and P be a 
probability distribution in E, i.e., a function P : E — [0, 1] such that 


Peat 
xeEE 
For each x € E, we call P(x) the probability of x; also, we say that the elements 


of E are equiprobable provided 


1 
P(x) = —, VxeE. 
|E| 


More generally, an event in E is a subset X of EF, and we let its probability be 
defined by 


POj= > Fo). 


xeEX 


Back to the problem we wish to analyse, let© E = I, x I, and assume that the 
elements of E' are equiprobable, so that the probability of each one of them equals 
+r. Given n € N, let P, be the probability that a randomly chosen ordered pair 
(a, b) € E have relatively prime entries a and b. We want to show that 


lim P, =—. 
n—>oo 2 


We shall first of all find an adequate expression for P,, and to this end we begin by 
observing that P, = a where 


X = {(a,b) € I, x In; ged(a, b) = 1} 


is the event subjacent to Py. 


5Ernesto Cesaro, Italian mathematician of the nineteenth century. 
® Asin previous chapters, we set J, = {1,2,..., n}. 
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Elementary counting gives 


|X| = 2#{(a,b) € I, x In; gced(a, b) = 1 anda < b} 
—#{(a,a) € I, x In; gcd(a,a) = 1} 


n 
= 2) Ha € In; ged(a, b) = 1 anda < b}—1 
b=1 


n 
=2)/ 9(b)-1, 
b=1 
where ¢ denotes Euler’s function. Hence, 


25° gb) -1 


b=1 
n 


P, = (9.12) 


2 


We now need the following auxiliary result. 
Lemma 9.10 Jf f : N > R is any function, then for n € N we have: 
(a) iat Vogal f(@) = via f® LE]. 
(b) pai k Yo<alk fd@) = 3 a kf (k) LE LE] + 1). 


Proof Item (a) is the content of Problem 12, page 219. Concerning (b), note firstly 
that the set of ordered pairs (d, k) of integers such that! <k <nandO<d|k 
coincides with the set of ordered pairs (d, k) of integers such that | < d < n and 
k = ld, for some integer | < 1 < n/d. Hence, we can change the order of the 
involved sums, thus getting 


Yk FD =>d. Ye KF@™ 


k=1 O<d|k d=1 k=Id 
Isi<4 


=> fa (a+ 244-4 |" ]a) 


d=1 


=r 31 (s+). 


Now, recall that, according to (8.5), 


d 
pinyan yo HO 


O0<d|n 
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where « : N — R stands for the Mébius function. Therefore, item (b) of the 
previous lemma furnishes 


2 vik) =29>4 “ 


k=1 O<d\k 


=n |§ I(E |+1) (9.13) 
= ao Bi +Dnw KE 


The second summand in the last sum above can be computed with the aid of item 
(a) of the previous lemma, together with the result of Proposition 8.7. Indeed, we 
have 


IC he Yew =1, 
k=1 


k=1 0<d|k 


since Loedit i(d) does not vanish only for k = 1. It follows from (9.12) and (9.13) 
that 


Li n\2 
Pa aa D0 | § | ; (9.14) 


With the last formula above at our disposal, we can state and prove the coming 
proposition, which assures that the limit lim,—.49 P, does exist and expresses it as 
another limit which, as we shall see, can be effectively computed. We begin by 
noticing that, since |w(k)| < 1 for every k € N and are 1 a is convergent, the 
series 


is absolutely convergent, hence convergent. 


Proposition 9.11 Jf P, is as in (9.14), then limy—so0 Py exists and is such that 
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Proof Firstly, from (9.14) we get 


=e (elel 2) 


(9.15) 


In order to estimate the last sum above, we claim that, given natural numbers n 
and k such that 1 < k < n, we have 


1 1 er 2 1 
< 
k2 n2 Lk nk n2 
Indeed, 
n 1 Be 41 Be 
<< <|n- — 
k ki~k k2 k ki ~ ke 
x 1 2 nm 1 1 |) 1 
< 
k2 kn nn? n2Lkt — k2 
a2 1 1 Bi 2 1 
a 
—k2 n2Llk kn nn?’ 
as wished. 


Back to (9.15), we obtain from the above estimates that 


n n 
2 1 2 1 1 
Ge eee : 


n 
c=] 


Now, from L’H6pital’s rule we get 
Zeal. 2 | 2 
-\io <= 1+ | -dt) = —=(ogn+1)> 0 
rer k on 1 ¢ n 


as n — +00. Hence, 


and our previous estimates assure that 
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To this end, we first need to show the coming result. 


Theorem 9.12 > : = m0 
, k>1 6 


237 


The proof we present is due to professors A. Yaglom and I. Yaglom (cf. Chapter 8 
of [1]) and relies on some elementary results on complex numbers and roots of 
polynomials; Chaps. 13 and 16 cover all that is necessary. The classical alternative 
proof, relying upon the theory of Fourier Series, is delineated in problems 11 and 12 
of Section 11.2 of [8] (alternatively, see Chapter 8 of [33] or Chapter 2 of [36]). 


Proof The first of de Moivre’s formulas (cf. Proposition 13.9) gives us 
sin(n9) = Im((cos@ + i sin@)”) 
n n 
=Im > ( ios 6)"-J (sins 
J 
j=0 


4) 
es (.,". (0 9) "-2k=D (gin gy OA+D, 


2 
7 k=0 

Therefore, 
sin(n@) er n ) corey» 
(indy 2k +1 


whenever sind # 0. 
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For n = 2m + 1, we have 


sin(2m+1)0) yy (2m+1 (2m—2k) 
= 2 1) ( | )(cot6) 


(sing) 27+) 2k + 


Thus, letting 


k 2m +1 m—k 
f@= De 1) Cos ; 


we have shown that 


sin((2m + 1)0) 


2) _ 
f((cot@)*) = (sin) Om) (9.16) 
Setting 0 = (x45) fork = 1,2,...,m, it is immediate from (9.16) that 
(cot 01)?, (cot @2)*, ..., (cot,)* are m distinct roots of f. However, since f has 


degree m, we conclude that those are all of its complex roots. Now, the Girard-Viéte 
relations between coefficients and roots of a polynomial (cf. Proposition 16.6) give 


2m+1 
2m —1 
(cot 6})* + (cot 2)? + +++ + (cotO)* = ( 3 ) = m(2m ) 


From this, basic Trigonometry allows us to compute 


(csc 01)? + +++ + (cSCOm)? = (1 + (cot)7) +--+ + (1 + (cotO,)*) 


=m + ((cot61)? +++ + (cot n)”) 
m(2m — 1) Le 2m(m + 1) 
3 ~ 3 


Finally, since 0; € (0, =) for 1 < j <m, we have for 1 < j < m the inequalities 
sin6; < 0; < tan6; 


or, which is the same, 


1 


(cot6;)* < < e < 


< (csc 0; ay", 
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Therefore, 
une) De se 5 % mS 1) 
=2 (cot; )" < > z < 2 (csc 0; )° = ——.~—— 


and, taking the definition of the 0; into account, 


no) =) = a a WY _ 2mm + Dy 
= 3 


Multiplying the last inequalities above by os letting m — +00 and noticing 


aa 


that 
m(2m — 1) . 2m(m + 1) 1 
im ——— = lim ———|S =.-, 
m>oo 3(2m + 1)? m>oo 3(2m + 1)2 6 
the squeezing principle for limits of sequences gives us the desired result. Oo 


Theorem 9.13 If py stands for the n-th prime number, then 


Proof Since py > n for every n € N, the fundamental theorem of arithmetic gives 
(upon expanding the right hand side below) 


Seid ae (145+ +=) 
k2 — pe pi ant 2 21 


Pn Pn 


Pa 
PES oes (145+ +a)<¥Z 
Py pip Pa Pel k?" 


1 


once more from the fundamental theorem of arithmetic, together with the fact that 
each summand obtained from the expansion of the products at the left hand side is 
also present at the right hand side. 

Since 
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we obtain the set of inequalities 


we get 


= 
ne 
IA 
Cs 
— 
| 
S| = 
~~ 
| 
emo 
— 
| 
x)= 
NS 
| 
| 
uM 
Il - 


k=1 


ou, which is the same, 


Finally, it now suffices to let — +00 and invoke the squeezing theorem. Oo 
Theorem 9.14 If py denotes the n-th prime number, then 


am TT(-4) =" 5 
k=1 


Px 


Proof In all that follows, we make the following convention on indices: 1 < m > 
it < im. Givenn EN, let: 


¢ Ay, be the set of natural numbers of the form pj, pi, ... Pi,,, with s € N and p;,, 


Pin» +++» Pir, being prime numbers such that pj, pi, ... Pir, <3 
¢ A, be the set of natural numbers of the form p;, p;, ... Pir,, With s € N, p;, and 
Pins +++» Pir, being primes such that 7), i2,...,i25 <n; 


¢ B, be the set of naturals of the form pj, pi, ... Pir,,,, With s € N and pj,, pis, 
. + Pizy,; being prime numbers such that pj, Piz... Pir.) S15 


° Bn be the set of naturals of the form pj, Pi, .-. Pir,,, With s € N and pj,, pi, 
.+ Pin,,, being prime numbers such that i), i2,..., i241 <5 


9.3. Césaro’s Theorem 241 


Let also 


Moreover, the limits lim a,, lim b,, lim a, and lim by, all exist, for the corresponding 
sequences are nondecreasing and bounded above by )° 7°, ra In particular, this 
reasoning assures the existence of the limit 


Z 1 
lim 1——)}. 
am TT (1- 4) 


Since p, > n for every n € N, we have Ay, C Ay C Ap; py...p, and By C Bn Cc 
Bp, po...pn» Lherefore, the following inequalities are valid: 


An < An Sap, py...p, ANd bn < bn < Vpypy...-9+ 
Hence, once more from the squeezing theorem, we get 


lim a, = lim ad, and lim by, = 
n—>oo n—>oo 


lim Dy, 
noo 00 


n 


so that 


n 
1 : 
lim I] 1——) = lim @ — lim by, = lim a, — lim by 


II 
5 


aad sale 
=> ">. 


as wished. oO 
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We can finally gather together the results above to compute 


k>1 k=l Pr 
=| lm I] ( — =) 
2 
fap ee Px 


Chapter 10 m®) 
The Relation of Congruence sei 


In this chapter, we define and explore the most basic properties of the important 
relation of congruence modulo n > 1. Our central goal is to prove the famous 
Fermat’s little theorem, as well as its generalization, due to Euler. The pervasiveness 
of these two results in elementary Number Theory owes a great deal to the fact that 
they form the starting point for a systematic study of the behavior of the remainders 
of powers of a natural number a upon division by a given natural number n > 
1, relatively prime with a. We also present the no less famous Chinese remainder 
theorem, along with some interesting applications. 


10.1 Basic Definitions and Properties 


The central object of study in this section is the relation on Z defined as follows. 


Definition 10.1 Let a, b and n be given integers, with n > 1. We say that a is 
congruent to b, modulo n, and denote a = b (modzn), provided n | (a — b). If a is 
not congruent to b modulo n, we write a € b (modn). 


Examples 10.2. According to the above definition, we can write: 


(a) 3 =5(mod2), for 2 | (3 —5). 

(b) —1 = 11 (mod 12), for 12 | (-1 — 11). 
(c) 2 =—1(mod3), for 3 | (2— (—1)). 
(d) x = —x (mod 2), for 2 | (x — (—x)). 
(ec) 1 #2 (mod 3), for 3} (1 —2). 

(f) 20 # 15 (mod 7), for 7 + (20 — 15). 


What are we really looking at when we consider congruences modulo n? In order 
to answer such a question, let us observe what happens with the integers modulo 4, 
for instance: 
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4k =0(mod4), 4k +1 = 1(mod4), 
4k +2 =2(mod4) and 4k + 3 = 3(mod4). 


Thus, the list ..., —5, —4, —3, —2, —1, 0, 1, 2, 3, 4, 5, ...of integers is equal, 
modulo 4, to 


..., 3,0, 1,2, 3,0, 1,2,3,0,1,... 


(with 0 corresponding to 0, of course), and we readily see that every integer is 
congruent, modulo 4, to its remainder upon division by 4. This result still holds 
true in general, as shown by the following result. 


Proposition 10.3. Let a and n be given integers, withn > 1. 


(a) If a leaves remainder r upon division by n, then a = r (modn). In particular, 
every integer is congruent, modulo n, to exactly one of the numbers 0, 1, 2, ..., 
n—-2,n—1. 

(b) a=b(modn) © a and b leave equal remainders when divided by n. 


Proof 


(a) Assume that a leaves remainder r upon division by n. Then, the division 
algorithm gives a = gn + r, for some integer q, and this gives that n | (a — r). 
But this is the same as writing a = r (modz). The rest is immediate. 

(b) If a = b(modn), then n | (a — b) and Corollary 6.9 (with a and b in place of 
a, and az, and n in place of b) shows that a and b leave equal remainders when 
divided by n. Conversely, if a and b leave the same remainder r upon division 
by 1, we can write a = nq, +r and b = nq2 +r, with q1, g2 € Z. Therefore, 
a—b=n(q — q2), so that n | (a — b). Hence, a = b(modn). 

| 


Remark 10.4 The definition of congruence modulo n excludes modulus n = | for, 
otherwise, a = b(mod1) would be a synonym of | | (a — b), which is always 
true. Hence, any two integers would be indistinguishable modulo 1, so that such a 
relation would be useless, as far as divisibility is concerned. 


Since congruence modulo n only sees remainders upon division by n, the reader 
may well be asking himself/herself what advantage(s) (if any) do we have in using 
it. As we shall see in a while, the first profit is computational in nature. Indeed, in 
the two coming propositions we prove some elementary properties of congruences 
that will allow us, for instance, to mechanically compute the remainder we get upon 
dividing 177°? by 13. Such a task is not so easy to accomplish by using the methods 
we developed so far. 


Proposition 10.5 Given integers a, b, c and n, with n > 1, we have: 


(a) a=a(modn). 
(b) a=b(modn) > b =a (modn). 
(c) a= b(modn) andb=c (modn) > a = c (modn). 
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Proof Items (a) and (b) are immediate. Concerning (c), if a = b (modn) and b = 
c(modn), then n | (a — b) andn | (b —c), and item i. of Remark 6.6 assures that n 
also divides a—c = (a—b)+(b—c). But this is the same as having a = c (modn). 

Oo 


Proposition 10.6 Leta, b, c, d, m and n be given integers, withm,n > 1. 


(a) Ifa = b(modn) and c = d(modn), thena+c = b+d(modn) and ac = 
bd (mod n). In particular, ac = bc (modn). 

(b) Ifa = b(modn), then ak = bk (mod n) for every k €N. 

(c) If co, C1,---,Cm € Zand f(x) = cmx™ +--+ + ¢1x + co is a polynomial 
function, then 


a =b(modn) > f(a) = f(b) (modn). 


(d) Ifa = b(modn), then gcd(a,n) = gcd(d, n). 

(e) Ifa+c=b+c(modn), thena = b(modn). 

(f) Ifac = bc (modn) and gcd(c,n) = d, then a = b (mod a) In particular, if 
gcd(c,n) = 1, thena = b(modn). 

(g) Ifa = b(modmn), then a = b(modm) anda = b (modn). 

(h) If a = b(modn) and a = b(modm), with gcd(m,n) = 1, thena = 
b (mod mn). 


Proof 


(a) Since (a+ c) — (b+ d) = (a—b)+(c—d),ac — bd =a(c—d)+(a—b)d 
and n | (a—b),n | (c —d), it follows from item (c) of Proposition 6.5 (see also 
item i. of Remark 6.6) that n | [(a +c) — (b+ d)] and n | (ac — bd). But this 
is the same as having a ++ c = b+d (modn) and ac = bd (mod zn). Finally, the 
particular case follows from c = c (modn). 

(b) Letting c = a andd = b in the second part of item (a), we get a* = b* (modn). 
On the other hand, had we already shown that a' = b! (modn) for some / € N, 
then, once more from the second part of (a) (this time with c = a’ andd = b'), 
we obtain 


at! —q.q@' =b-b! =b'*! (modn). 


Item (b) follows, then, by induction on k. 
(c) If a= b (modzn), then we get from items (a) and (b) that cpak = cxb* (mod n), 
for 0 < k < m. Hence, it follows from Problem | that 


m m 


f(a) = Yo cea* = iar = f(b) (modn). 


k=0 k=0 
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(d) Since a = b (modn), there exists g € Z such that a = b+ nq. We then wish to 
show that 


gcd(b + ng,n) = gcd(b, n). 


But this is immediate from item (b) of Proposition 6.22. 

(ce) Ifa+c=b+c(modn), then n divides (a + c) — (b +c) = a — b, which is 
the same as a = b (modn). 

(f) Let n = dn’ and c = dc’, with c’ and n’ being relatively prime integers. From 
ac = bc (modn) we get (dn’) | [dc’(a—b)] or, which is the same, n’ | c'(a—b). 
However, since gcd(n’, c’) = 1, item (a) of Proposition 6.22 gives n’ | (a — b). 
Since n' = 4, this is the same as a = b (mod 4). The rest is immediate. 

(g) Ifa = b(modmn), then mn | (a — b), so that m | (a — b). This last relation is 
equivalent to a = b (modm). Analogously, a = b (modn). 

(h) Since m,n | (a — b) and gcd(m,n) = 1, it follows from item (d) of 
Proposition 6.22 that mn | (a — b), which is exactly what we wanted to prove. 

Oo 


We are now in position to solve the example below. 
Example 10.7 Compute the remainder we get upon dividing 177° by 13. 
Solution Since 17 = 4 (mod 13) and 16 = 3 (mod 13), item (b) of the previous 
proposition gives 


172002 = 42002 = 161001 = 31001 (mod 13). 


Now, observing that 33 = 1 (mod 13) and applying items (a) and (b) of the same 
proposition, we obtain 


31001 = 32 . 3999 =—9. (ah eae =9. 133 = 9(mod 13). 
Then, Proposition 10.3 guarantees that 177°°? leaves remainder 9 when divided 


by 13. o 


The elementary properties of congruence deduced in Proposition 10.6 allow us 
to establish the criterion of divisibility by 9 in a much simpler way that that hinted 
to in Problem |, page 162. This is our next 


Example 10.8 A natural number n and the sum of the algarisms of its decimal 
representation leave equal remainders upon division by 9. 


Proof Letting n = (agag_ 1 ...a1d40)19 be the decimal representation of the natural 
number n, we have 


Wa, 10" + op 10"! 4 <>< a 10-4 ay. 
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Since 10 = 1 (mod 9), item (c) of Proposition 10.6 (with f(x) = agx* +ag_yx*—!4+ 
-- + a,x + ao) gives 


n= f(10) = fl) = ag + ag-1 + +++ + a1 +49 (mod 9). 


The rest follows from item (b) of Proposition 10.3. oO 


As an additional example of the computational simplification the congruence 
relation brings (and for future reference), we shall prove once again the results 
of Corollary 6.8 and Example 6.10. Before that, however, it is time we make the 
following simple remark. 

As we already know, every integer is congruent to 0, 1, 2, 3, 4, 5 or 6, modulo 7; 
also, since 


= —3(mod7), 5 =—2(mod7) and 6 = —1(mod7), 


we conclude that every integer is congruent, modulo 7, to one of 0, +1, +2 or +3. 
On the other hand, every integer is congruent to 0, 1, 2, 3, 4, 5, 6 or 7, modulo 8; 
however, since 


= —3(mod8), 6 =—2(mod8) and 7 = —1(mod8), 


it follows that every integer is congruent, modulo 8, to 0, +1, +2, +3 or 4. The 
advantage of replacing, modulo 7, the integers from 0 to 6 by 0, +1, +2, +3 lies 
on the fact that, if in some context one needs to raise the remainders upon division 
by 7 to some exponent k, then one is likely to have much less work by using 0, +1, 
2, 43 in place of 0, 1, 2, 3, 4, 5, 6, for (—x)* = +x". By the same token, it is 
sometimes advantageous to replace, modulo 8, the usual remainders 0, 1, 2, 3, 4, 5, 
6, 7 by 0, £1, 2, +3, 4. 

Generalizing the discussion of the previous paragraph, it is not hard to verify 
that: 


_ 


i. Ifn = 2k, then every integer is congruent, modulo n, to one of 


0, +1,42,...,4(k—1),k. 


ii. Ifm = 2k + 1, then every integer is congruent, modulo 7, to one of 


O, LED, oe ey teks 


We shall go one step further in formalizing the above discussion when we study 
the concept of complete residue systems, in Sect.11.1. For the time being, we 
establish the above mentioned results. 
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Proposition 10.9 For every a € Z, one has: 


(a) a* =0,1,4,5, 6 or 9 (mod 10). 
(b) a* =0 or 1 (mod3). 

(c) a* =0 or 1 (mod 4). 

(d) a2 =0,1 or 4(mod 8). 

(e) a* =0 or 1 (mod 16). 


Proof 


(a) Modulo 10, we have a = 0, +1, +2, +3, +4 or 5, so that 


a’ = 0°, (£1)”, (+2), (+3)*, (+4)* or 5” (mod 10) 


or, which is the same, a? = 0, 1, 4, 9, 6 or 5 (mod 10). However, since the last 
digit of a natural number is equal to the remainder we get upon dividing it by 
10, it follows that the last digit of a? is equal to one of 0, 1, 4, 5, 6 or 9. 

(b) We know that a = 0 or £1 (mod 3), so that a* = 0? or (£1)? (mod 3). Hence, 
a? = Oor | (mod3). 

(c) Since a = 0,+1 or 2(mod4), we get a = 07, (+1)? or 27 (mod 4). Since 
2? = 0 (mod 4), it follows that a” = 0 or 1 (mod 4). 

(d) As in the previous items, we have a = 0, +1, +2 


3 or 4 (mod 8), and then 


a? = 0°, (£1)”, (+2)”, (+3)? or 47 (mod 8). 


Now, 3” = 9 = 1 and 4? = 16 = 0(mod 8), so that a? = 0, 1 or 4 (mod 8). 


(e) By item (d), we have a* = 8g +r, with g € N andr = 0, 1 or 4. Hence, 


wm 


at = (8q +r)* = 16(4q7 +. gr) +r? = 16q'+0 or 16q’ +1, 


with g’ EN. 
o 


The coming examples show how we can use what we have learned up to this 
point about congruences to solve various interesting problems. Generally speaking, 
the solutions we shall present will be a mixture of a judicious use of one or 
more congruence relations, together with other ideas, each of which adapted to 
the situation at hand. The reader should make an effort to perceive exactly how 
each used tool helps in paving the way to the final solution, for this attitude will be 
of great help in preparing him/her to tackle the problems posed at the end of the 
section. 


Example 10.10 (Italy) Find all x, y € N for which x* + 615 = 2”. 


Solution Looking at the given equation modulo 3, we obtain 


x? + 0 = (—1)” (mod 3). 
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Now, by item (b) of Proposition 10.9, we have x = 0 or | (mod3), so that the 
congruence above gives us the following possibilities: 


0 = (-1)” (mod 3) or 1 = (—1)” (mod 3). 


The first one clearly never occurs. In what concerns the second, if y is odd, we get 
1 = —1(mod3), which never happens too. Hence, y must be even, say y = 22, 
with z € N. The stated equation can then be written as 


615 = 2% — x? = (2% — x)(27 + x). 


Finally, since 2* + x > 2 — x and 615 = 3-5-41, we are left to analysing the 
linear systems below: 


’ 


2%+x=615 2% +x = 205 
BZo-x=1 7 |2%-x=3 


2% +x = 123 ay 2+x=41 
2—x=5 2Z—x=150 


To this end, in each case we add both equations to obtain 22+] — 616, 208, 128 
or 56, respectively. However, since 2°*! is a power of 2, the only allowed possibility 
is 2°+1 = 128, so that 


2 +x = 123 
2Z—-—x=5 — 


Then, z = 6 and x = 59, and the only solution to the original equation is x = 59 
and y = 2z = 12. Oo 
Example 10.11 (OIM) Find all m,n € N such that 2” + 1 = 3”. 

Solution If m = 1, then clearly n = 1. If m > 2, then 2” = 0 (mod 4) and, looking 
at the equation modulo 4, we obtain 


1 = (—1)" (mod 4). 


It follows that n must be even, say n = 2u, with u € N. Substituting this into the 
given equation, we get 2” + 1 = 37” or, which is the same, 


Pie = (3" = 1)(3" + 1). 
In view of the Fundamental Theorem of Arithmetic, the equality above assures 
that both 3“ — 1 and 3” + 1 must be powers of 2. To get further information, let 


d = gced(3” — 1, 3” + 1) and observe that 


Z(G" d= 3" = 1), 
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i.e., d | 2; however, since 3“ — 1 and 3” + | are both even numbers, we have d = 2. 
Now, the only way the gcd of two powers of 2 can be equal to 2 is when the smallest 
one is itself equal to 2, so that the only possibility is to have 


3% 1 =2 
41-2") 
Thus, uv = 1, m = 3 and, hence, n = 2. oO 


Example 10.12 (OCM) Do the following items: 


(a) Prove that there does exist x, y, z € N such that 13x+ + 3y4 — z+ = 2013. 
(b) Prove that there does not exist x, y, z € N such that 13x4 + 3y* — z+ = 2014. 


Proof 


(a) Letting z = 2x, we get ag = 671 or, which is the same, 07 =2)07" +37) = 
11-61. Hence, the Fundamental Theorem of Arithmetic gives us the possibilities 


so that x = 5, y = 6 and z = 2x = 10. 

(b) By the sake of contradiction, assume that a solution does exist. Item (e) of 
Proposition 10.9 gives at = 0 or 1(mod8), for every a € Z. Therefore, 
13x4 + 3y+ — z+ = 0,2, 4,5 or 7 (mod 8). However, since 2014 = 6 (mod 8), 
we have reached a contradiction. 

Oo 


The reader will probably notice that the solutions of our last two examples are 
somewhat more terse than those of the previous ones. An effective training for the 
proposed problems is to fulfill the omitted details. 


TM p.2n 
f moe E 


Example 10.13 (Romania) Let m,n, p € N, where p is an odd prime. I Tp 


N, prove that such a number is itself prime. 


7m p: gn 


Tm pF . Since 


Proof Leta = 


ey ae 2.7m 
1+ ; 
7m — p-2n 7m — p.2n 


it follows that (7” — p-2”) | gcd(p-2”t!, 2-7"). Now, there are two possibilities: 


(i) p = 7: in this case, 
(7” —7-2") | gced(7 - gn) 2.7" )\= 14, 


which gives 7”~! — 2” = 1. Analysing such an equation modulo 3, we reach a 
contradiction. 
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(ii) p #7: in this case, we get gcd(p-2”*+!, 2-7") = 2 and, hence, (7” — p-2”) | 2. 
Again, this implies 7” — p- 2” = 1 and, looking at such last equality also 
modulo 3, we obtain 1” — p- 2” = 1 (mod3), so that p = 3. It follows that 
7” —3-2"=l1anda=7"+3.-2". 

If m = 1, then n = | and, hence a = 13, a prime number. If m > 1, then 
n > 1 and 

1 i qm — 1 

aa: 


It follows that m is even, say m = 2k. Thus, we get 49 — 1 = 3- 2” and 
distinguish two subcases: 


Of. Sa ie, wate a4 


e Ifk = 1, thenm = 2,n = 4 anda = 97, once more a prime number. 
* Ifk > 1 then, factoring 49* — 1, we get 


AQP) 4 eres Os a DPF 


Such an equality gives k even and, writing k = 21, we obtain 


3-2" = 407 _ 1 = (-1)~’ — 1 = 0(mod5), 


which is an absurd. 
oO 


Example 10.14 (BMO) Let (an)n>1 be the sequence defined forn > 1 by a, = 
2” + 49. Find all integers n > 1 for which a, = pq and ay, = rs, where p,q, Tr, 
s are primes such that p <qg,r <sandq-—p=s-r. 


Solution Let be a natural number for which a, and a,+, satisfy the stated 
conditions. If p,r > 3, then we get p,q,r,s = 1 or 5(mod6). However, since 
52 = 1(mod6), in any case we obtain pq,rs = 1 or 5(mod6). On the other 
hand, if k denotes the odd element of {n,n + 1}, then a= 2 (mod 6), whence 
2449 =24+1=3 (mod 6), which, in turn, is an absurd. Thus, we must have 
Pp <3orr < 3, so that at least one of p, r must be equal to 3 (for a, and ady41 are 
both odd). 

If p > r,theng =s+p-—r > s, implying that a, = pg > rs = ay41,a 
contradiction. Therefore, p < r, and hence p = 3 andg = 3+ 5s — r. We thus get 
an = pq = 3(33 +s —1r). On the other hand, we also have 


Qan = 2"t! 4.98 = (2"*! 4. 49) +. 49 = ayy +49 = rs +49, 


so that 6(3 + s —r) =rs +49. This gives r = 6 — fo, and hence s = 61,r = 5, 
q = 59. Then, 


3-59 =a, = 2" +49 > 2" = 128 Sn =7. 
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Problems: Sect. 10.1 


1. * Generalize item (a) of Proposition 10.6, showing that if m,n > 1 are natural 
numbers and aj,..., 4m, D1, ..., bm are integers such that a, = by (modn) for 
1<k<~m, then 


m m m m 
S > ae = D> bg (modn) and | | ax = | | dy (modn). 
k=1 k=1 


k=1 k=1 


2. Let n = (apag_1...a1a9)19 be the decimal representation of the natural 
number. Prove that the remainder of n upon division by 11 is equal to that of 


iy = Gi is — ee (1) a Ee, 


. Find the remainder of the division of 1000°° + 55!0° by 7. 

. Show that 8190 + 379°! is not a perfect square. 

. Find the remainder upon division of Be by 5. 

. Letn € N be such that n = —1 (mod 4). Prove that there exists a prime p such 
that p | n and p = —1 (mod 4). 

7. * Use the fact that 24 + 54 = 641 = 27. 5 + 1 to show that 2” + 1 is nota 

prime number. 
8. (IMO) Find the smallest natural number 7 satisfying the two following 
conditions: 


Nm WW 


(i) The last digit of the decimal representation of n is 6. 
(ii) If we erase this last digit 6 and write it immediately to the left of the first 
digit of n, we obtain the number 47. 


9. Find all positive integers n such that 7 | (2” + 3”). 
10. Find all prime numbers p and q for which p* + 3pq + q? is a perfect square. 
The coming three problems are concerned with the Fibonacci and Lucas 
sequences, so the reader may find it convenient to start by recalling (cf. 
Sects. 1.3 and 3.1) that the Fibonacci sequence (F;,)n>1 is given by Fy = Fy = 
land Fy42 = Fy41 + Fe, whereas the Lucas sequence (Ly,),>1 is defined by 
L, = 1, Lz =3 and Ley = Ley + Lx, for every integer n > 1. 
11. Prove the following properties of the Fibonacci and Lucas sequences: 


(a) 2| Fx &@3|nand4| F, 3 6|n. 

(b) 2| Ly, @3|nand4|L, @n = 3(mod6). 
(c) 3| Ln @n =2 (mod 4). 

(d) Ly = 3(mod 4) if 2 | n and 3 {n. 

(e) Ly+12 = Ln(mod 8), for every n € N. 


12. Also concerning the Fibonacci and Lucas sequences, prove that 


1, if 3tn 


d(F,, Ln) = ‘ 
ge ( no n) {3 if 3|n 


10.1 


13. 


14. 


15. 
16. 


17. 
18. 


19. 
20. 


21. 
22. 


23. 


24. 
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Prove the following properties of the Fibonacci and Lucas sequence, for every 
m,n, k € N such that 2 | k and 3k. 


(a) 2F, ta = Finly + Fal. 

(b) 2Lintn = 5 Fin Fn + Linky. 

(c) Fint2nr = (—1)' Fin (mod Lx), for every t € N. 
(d) Lin+oxn = (—-1)' Lm (mod Lx), for every t € N. 


(USA) Prove that the equation ey + xs + a3 Se ae = 15,999 does not 
have any integer solutions. 

(Romania) Find all natural numbers m, n, k such that 2” + 3” = k?. 

(Soviet Union) Find all solutions in integers m,n, p > | of the equation 


I+ 2!4+---+n!=m?. 


(Bulgaria) Find all positive integers x, y and z for which 3* + 4” = 5%. 
(Tchecoslovaquia) Find all positive integers x, y and p such that p is prime and 
ptr y? = 1. 

Find all a, b, c € N such that a and b are even and a? + b% = 2°. 

(Hungary) Let a and b be given naturals and n be a nonnegative integer such 
that a” | b. Prove that a”*! divides (a + 1)? — 1. 

(Brazil) Find all positive integer solutions of the equation x* + 15¢ = 2°. 


101992 6 . 
10847 | Justify your 


(France) Compute the digit of units of the integer part of 
answer. 

(China—adapted) The objective of this problem is to solve equation 7* —3” = 4 
in natural numbers, showing that x = y = 1 is its only solution. To this end, do 
the following items: 


(a) Use modulo 8 to conclude that there are no solutions if x is even. 

(b) If x is odd and y > 1, look at the equation modulo 9 to show that x = 
2 (mod 3) and, hence, that x = 5 (mod 6). 

(c) Writing x = 6g + 5, with g € N, conclude that 7* = +2 (mod 13). 

(d) Show that every power of 3 is congruent to 1, 3 or 9, modulo 13. 

(e) Assuming that y > 1, use items (c) and (d) to reach a contradiction. 


(Bulgaria—adapted) The purpose of this problem is to show that the only 
solution of the equation 5*7” + 4 = 3¢ in nonnegative integers x, y and z 
is x = 1, y =O and z = 2. To this end, do the following items: 


(a) If x = 0, the equation reduces to 7” + 4 = 3, so that z > 2. Use modulo 9 
to show that, in this case, there are no solutions. 

(b) From now on, assume x > 1. Use modulo 5 to conclude that z = 2r, for 
some t EN. 

Show that gcd(3’ — 2, 3 + 2) = 1. Then, conclude that: (i) 3’ — 2 = 1 and 
3° +2 = 5°79, (ii) 3’ — 2 = 5* and 3’ + 2 = 7° or (iii) 3° — 2 = 7” and 
3°+2= 57. 

In case (11), use modulo 3 to show that there are no solutions. 


(c 


Nee 


(d 


wm 
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(e) In case (iii), we have 5* — 7” = 4. From this, use modulo 4 to conclude 
that y is even. Finally, if x > 2, use modulo 25 to get a contradiction. 


25. (Miklé6s-Schweitzer—adapted) In order to solve equation (x + 1)» —x* = lin 
integers x, y, z > 1, do the following items: 


(a) Look at the equation modulo x + | to conclude that z is odd. 

(b) Write (x+1)” = x*+1 and factor the right hand side to show that x is even. 
Then, write x*~! = —_ and factor the right hand side to conclude that 
y is also even. : 

(c) If x = 2s and y = 2, with, t € N, show that (x +1)! —1 and (x+1)'+1 
have gcd equal to 2. From this, use modulo x to show that (x-+1)'—1 = 2s% 
and (x + 1) +1 = 2271, 

(d) Use inequality 2s? < 2! to show that s = 1. Then, successively get 
x=2,t=1,y=2andz=3. 


10.2. The Theorems of Euler and Fermat 


The effective use of congruences to compute remainders is considerably simplified 
if we find exponents which make a certain power congruent to |. For instance, 
knowing that 7> = 1(mod9) it rests much easier to compute the remainder of 
25100! upon division by 9: since 25 = 7 (mod 9), we have 


251001 = 71001 = (P79. 7? = 1° . 49 = 4 (mod 9). 


In this direction, given relatively prime integers a and n, withn > 1, the purpose 
of this section is to find an exponent k € N for which 


a‘ = 1(modn). 


To this end, we shall first look at the case in which n is prime, thus proving one of 
the most important results of the elementary theory of congruences: Fermat’s little 
theorem. ! 


Theorem 10.15 (Fermat) Givena, p € Z, with p prime, we have a? = a (mod p). 
In particular, if gcd(a, p) = 1, then 


a?! = 1 (mod p). (10.1) 


Proof If a? = a(mod p), then p divides a? — a = a(a?~! — 1). Hence, if 
gcd(a, p) = 1, then (10.1) follows from item (a) of Proposition 6.22. 


'A combinatorial proof of Fermat’s little theorem was the object of Problem 5, page 57. 
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It thus suffices to show that a? = a (mod p), for every a € Z. If p = 2, there is 
nothing to do, for a? — a = a(a — 1), being the product of two consecutive integers, 
is always even. Let us now assume that p > 2 and prove the desired result, firstly for 
a > O, by induction on a. For a = 1, there is nothing to do. Assume, by induction 
hypothesis, that the theorem holds for a certain natural value of a, i.e., assume that 
kP =k (mod p), for some k € N. Fora = k + 1, we have 


p-1 

k+ I? -k+D= 0-4 (P)ar, 
hy 
j=l 


However, since p | (k? —k) (by induction hypothesis) and p | (7) forl < j < p-l 
(by Example 6.42), it follows that p divides (k + 1)? — (k + 1), so that (K+ 1)? = 
(k + 1) (mod p). 

Let us now look at the case a < 0 and p odd. If a = 0, once more there is nothing 
to do. If a < 0, then, since p is odd, what we did above gives 


a? = —(—a)? = —(—a) =a (mod p). 


| 


Before we extend Fermat’s little theorem to a composite modulo, we collect a 
few examples that illustrate some applications of it. 


Example 10.16 If p and q are distinct primes, prove that pg divides p?—!+q?~!-1. 


Proof Since p and gq are distinct primes, we have gcd(p,q) = 1. Hence, by 
Fermat’s little theorem, g divides p?~! — 1. However, since q obviously divides 
q?—', we conclude that g divides g?~! + (p7~! — 1) as well. Analogously, p 
divides p?—! + (q?—! — 1). Finally, since p and q both divide p?~! + q?-! — 1 
and are relatively prime, item (d) of Proposition 6.22 guarantees that pq divides 
pi-! 4 qP-! — 1 too. Oo 


Example 10.17 (Romania) Let p and q be prime numbers, with g 4 5. If qg | Q?+ 
3?), prove that q > p. 


Proof Since q | (2? +3”), we clearly have q 4 2,3; since g # 5 by hypothesis, 
we thus have g > 5. Hence, we can assume that p > 3. If g < p, then g — 
1 < p, so that g — 1 and p are relatively prime. In such a case, Bézout’s theorem 
assures the existence of x, y € N such that px = (gq — 1)y + 1. Therefore, from 
2? = —3?(modq) we successively get (2?)* = (—3”)* (mod q) and (since —3? = 


(=3)") 


2@-Dy+] = (-3)@-D9+! nod g). 


256 10 The Relation of Congruence 


Finally, since ¢ 4 2,3, Fermat’s little theorem gives 27—!, (—3)?—! = 1 (modq); 
then, the above congruence reduces to 2 = —3(modq), so that gq = 5. Finally, such 
a conclusion is obviously a contradiction. oO 


Example 10.18 (BMO) Let p > 2 be a prime number such that 3 | (p — 2), and let 
S={y*-x°-1; 0<x,y < pandx,y €Z}. 


Prove that S' contains at most p — | multiples of p. 


Proof If0 < u,v < p—1 and w=v (mod p), we claim that u = v. To this 
end, start by noticing that u? = v? = 0 (mod p) if and only if u = v = 0; on the 
other hand, for 1 < u, v < p—1, Fermat’s little theorem, together with the fact that 
p—1=3k+1, give 


wkt! = y3*+] (mod p). (10.2) 


Now, 


u> = v* (mod p)=> wrk = y>* (mod p), 


and it comes from (10.2) that 


ES Si (mod p). 


Finally, cancelling out u* from the above congruence, we obtain u = v (mod p); 
condition | < u,v < p — | thus implies that u = v. 
The above discussion guarantees that 


{x7 02% = p—1} ={0,1,...;p— 1}. (10.3) 

On the other hand, y* — x* — 1 € S is a multiple of p if and only if y* = x? — 

1 (mod p). Hence, it follows from (10.3) that, for each O < y < p — 1, there exists 
a single 0 < x < p —1 for which p divides y* — x3 — 1. 

The above reasoning would provide us with at most p multiples of p in S. 

Nevertheless, note that 0 = 1* — 0? — 1 = 33 — 23 — 1 is represented twice in 

S, so that S actually contains no more that p — 1 multiples of p. Oo 


We generalize Fermat’s little theorem with the following result of L. Euler, which 
brings Euler’s function g into prominence. For its statement, notice that if a* = 
1 (modn), then item (d) of Proposition 10.6 assures that gcd(a*, n) = gced(1,n) = 
1; in particular, gcd(a, n) = 1. 


Theorem 10.19 (Euler) Jf a and n are relatively prime integers, withn > 1, then 


a?™ = 1 (modn). (10.4) 
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Proof Let p be a prime factor of n and k be a natural number. We start by making 
induction on k to show that 


a?) = | (mod p*). (10.5) 


Case k = 1 reduces to Fermat’s little theorem. Now, assume, by induction 
hypothesis, that ger) = | (mod p!), for some natural /. Then, gee) — p'qg+1 for 
some integer g, and we have 


+1) 


a?) 7 a aPeP) jn (a?(P')yP 1 


Pp 
=(p'¢+1)?-1=)> (°) (p'q)i - 1 


j=0 


Dp 
7 (i) +2 ( Jet! 
ja 


= p! Maroy (fer Mod, 


= 0(mod p't!), 


since 2/ > 1+ 1 for every integer / > 1. 
In order to finish the proof, let n = p}'' p53”... pr‘ be the canonical decomposi- 
tion of n into powers of distinct primes, so that (cf. Theorem 8.12) 


p(n) = v(pt')e(p3").-- 9(p;"). 


—_———_ 


Letting mj; = o(p}') ears (P;') ae p(P,*) (where the ~ above a factor means that 


it is omitted from the corresponding product) we have g(n) = g( p )mj, and it 
follows from (10.5) that 


a; mj , 
q?™ — (ae) = yi = 1 (mod De): 


However, since the powers P;! are pairwise relatively prime, item (h) of Proposi- 
tion 10.6 guarantees that a?™ = 1 (modn). oO 


As in the case of Fermat’s little theorem, we pause to see, in a few examples, 
some interesting applications of Euler’s theorem. 


Example 10.20 (Brazil) Prove that there exists an integer k > 2 such that the 
number 199...91 is a multiple of 1991. 
—_=— 


k 
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Proof Notice firstly that 
199...91=2-104+!_9, 
>’ 
k 


Hence, we want to find an integer k > 2 such that 
2- 104+! = 9 (mod 1991). 
To this end, since 2 - 10° = 9 (mod 1991), we have 


2-10'+! = 9 (mod 1991) = 2- 10+! = 2- 10° (mod 1991) 
& 10‘? = 1 (mod 1991). 
For what is left to do, note that 1991 = 11-181 and 181 is prime (by Eratostenes’ 
sieve, for instance). Hence, 9(1991) = g(11) - g(181) = 10- 180 = 1800, and 


Euler’s theorem gives 10!800 = | (mod 1991). Therefore, it suffices to take k — 2 = 
1800. oO 


Example 10.21 (Romania) Let a and n be given natural numbers. Show that we can 
choose n pairwise relatively prime elements from the set 


ADIGE 16 a HL ae = 1, 


Proof By induction, let k € N and assume that we have already established the 
existence of a subset Bg C A such that | B,| = k and the elements of By; are pairwise 
relatively prime. Let 


m= [| «- 


xe Br 
Since m = 1 (moda), we have gcd(a,m) = 1. Therefore, Euler’s theorem gives 
a?) = 1 (modm). 
If y = a?! 4 ge” _ |, then y € A. We claim that gcd(y, x) = 1 for every 


x € Bx. To this end, it suffices to show that gcd(y, m) = 1. However, again from 
Euler’s theorem, we get 


y= al Mtl 4 ge _ 1 =a+1—1=a(modm), 
and item (d) of Proposition 10.6 guarantees that 
ged(y, m) = ged(a,m) = 1. 


Hence, the set By4,; = By, U {y} is formed by k + 1 pairwise relatively prime 
elements of A, thus completing the induction step. oO 
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For the next example, note that if a, k € N, then the decimal representation of ak 
has exactly [k log, a] + 1 digits. Indeed, letting m denote the number of algarisms 
of a*, we have 10”—! < ak < 10” and, then, 


m—-l1< log), a* <M, 


hence, m — 1 = [logyya*| = [k log, a], as wished. 


Example 10.22 Prove that there exists a power of 2 with 1000 consecutive zeros in 
its decimal representation. 


Proof We wish to find m € N such that the decimal representation of 2” is of the 
form 


m _ 
2” = («#*...*00...0% *...*) 19. 
1000 


Alternatively, letting A and B denote the natural numbers formed by the portions of 
the decimal expansion of 2” situated respectively to the right and to the left of the 
sequence of the 1000 consecutive zeros, we want to find m € N such that 


gm = B : 19 1000+ a A, 


with the decimal representation of A having at most a digits. 
To this end, if we let k = 1000 +a and A = 2‘, we will have 


Ouse = B . 19 1000+ + A 


if and only if the decimal representation of 2* has at most k — 1000 algarisms and 
(dividing both sides of the last equality displayed above by 2") 2”-* = B.5* +1. 
In turn, this last condition will hold if and only if 2” age | (mod 5‘). 

We then have to show that it is possible to obtain k, m € N such that: (1) m > k; 
(ii) the decimal representation of 2* has at most k — 1000 algarisms; (iii) gmk — 
1 (mod 5“). Let us start by analysing conditions (i) and (iii). 

By Euler’s theorem, we have 295) = 1 (mod 5‘). Now, since g(5*) =aAe5e 1 
there must exist g € N such that 2aset 5*q +1. Letting m =k +4-5*-!, we 
have m > k and 2”—-* = 1 (mod 5). 

We are then left to showing that it is possible to find k € N such that the 
decimal representation of 2* has at most k — 1000 digits. To this end, the discussion 
preceding the statement of this example assures that the decimal representation of 
2* has exactly [k logy, 2] + 1, so that it suffices to ask that the following inequality 
holds: 


[k logy) 2) +1 < k — 1000. 
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However, since 
k — [klogj92| > k —klogyg2 = k logy 5, 


it is enough to start by taking a natural number k for which k log;g5 > 1001. This 
is surely possible. Oo 


Problems: Sect. 10.2 


1. (Slovenia) We are given several integers whose sum equals 1496. Is it possible 
that the sum of their seventh powers equals 1999? 
2. (Australia) If p is a prime number, prove that 


11...1122...22...99...9 —123456789 
—S Sr YS Kj — 
P P P 


is divisible by p. 
3. Given a sequence (%1, *2, X3,..-,;X2n—2, X2n—1, X2n) Of reals, an allowed oper- 
ation is to replace it by the sequence 


(Xn41, XI, Xn4+2> X25 +++5X2n—15 Xn—15%X2n; Xn). 


Assume that we start with the sequence (1, 2,3,..., 27 —2, 2n — 1, 2n), where 
2n + 1 is a prime number. Show that, after performing 2n allowed operations, 
all of the numbers will be back to their original positions. 

4, (BMO) Prove that the equation y* = x° — 4 has no integer solutions. 

5. (USA) Given a prime p, prove that there are infinitely many naturals n for 
which p divides 2” — n. 

6. (Romania) Prove that there does not exist an integer n > 1 such that n divides 
3" — 2". 

7. (Bulgaria—adapted) Given prime numbers p and q, do the following items: 


(a) If p | (5? — 2°), prove that p = 3. 
(b) If p => q andg | (5? — 2?), prove that g = 3. 
(c) Find all such p and g for which pq | (5? — 2?)(54 — 24). 


8. (BMO) Prove that, for every given natural number n, there exists a natural 
number m > n such that the decimal representation of 5” is obtained from 
that of 5” by adding a string of algarisms to the left of it. 

9. (IMO) Prove that, for every integer n > 1, there exist pairwise distinct integers 
ki, ko, ..., ky > 1 such that each two of the numbers 2"! — 3, 2% — 3, ..., 
2k» — 3 are relatively prime. 
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10. Let (an)n>1 be the sequence of positive integers implicitly defined by the 
equality 


> dq = 2", 


O<d|n 


which is valid for every n € N. Prove that n | a, for every n € N. 

11. (iran—adapted) Euler’s theorem guarantees that, for every integer n > 2, each 
prime factor of n is also a prime factor of 2) — 1. Nevertheless, the converse 
claim is not true in general, namely, if n > 3 is odd, then 2?) — 1 always has 
prime factors which are not prime factors of n. In order to prove this, do the 
following items: 


(a) Let p1,..., px be pairwise distinct odd primes, such that k > 1 or py ¥ 3. 
Show that there does not exist integers a1,...,a@% > 0, not all zero, such 
that 


7(Pi-).--Pr- Dy Py! coe 


(b) Conclude that if n 4 1, 3* is and odd positive integer, then there exists a 
prime p such that p + n but p | (29 — 1). 
(c) Ifn = 3, with k > 2, show that 2?” — 1 = 0(mod7). 


12. The purpose of this problem is to prove the following result, which is known as 
Sophie Germain’s theorem’: if p is a prime number such that 2p + 1 = q is 
also prime, and if x, y and z are integers for which x? + y? + z? = 0, then p 
divides at least one of x, y and z. To this end, show the items below: 


(a) We can assume that x, y and z are pairwise relatively prime and that p > 2. 

(b) Ifx? + y? +z? =0, then p| (n+ y+z). 

(c) By contradiction, assume from now on that p { x, y,z.Ifr 4 p is a prime 
divisor of y + z, then 


ye! _ yP2z Be eng sit yzP—? 47-1 = py?! (modr). 
(d) Conclude from (c) that r + (y?~! — yP-?z +... — yzP-? 4 zP7!) and, 
hence, that ged(y + z, y?-! — yP-2z 4... — yeP-2 4 PF) = 1. 
(e) Show that there exist a,b,c,d € Z such that y+ z= a?,z+x = DP, 
x + y= cP and ns = yP-2z + at a ae ee, yer? + zP-l = d?. 


(e) Since p = - we have x“? + yr + ae = 0. Now, apply Fermat’s 


little theorem to show that q | xyz. 


? After Marie-Sophie Germain, French mathematician of the eighteenth and nineteenth centuries. 
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(f) If gq | x, deduce that b? + c? — a? = O(modgq). Then, substitute 
p= a1 and use Fermat’s little theorem once again to show that g | abc. 
Subsequently: 


i. If g | b org | c, conclude that g | x, y org | x, z, thus contradicting 
item (a). Therefore, g | a = (y +z). 

ii. It follows from (c) that d? = py?—! = pc?!) (mod q). Substitute 
p= a1 and apply Fermat’s little theorem yet another time to show 


that +1 = p (mod q), thus reaching a new contradiction. 


13. (India—adapted) For each x € N, let S(x) denote the set 
S(x) ={y EN; gy) =x, for some k € N}, 


where ¢ stands for the Euler function, gy!) = g and, for each k > 1 natural, 
gy“ is the composite of y with itself, k times. Let also 


TS? 3 eZ a= iy. 
Do the items below to show that 


x €T & S(x) is infinite. 


(a) Prove that x € T > S(x) is infinite. 

(b) For each x € N, let u(x) denote the exponent of the greatest power of 2 
that divides x. Show that u(g(x)) > u(x). 

(c) Ifx ¢ T, prove that u(g(x)) = u(x) if and only if x = 2” p*, where m € N 
and p > 7 is a prime such that p = 3 (mod 4). 

(d) If x ¢ T is such that u(y (x)) = u(y(x)) = u(x), prove that the natural 

number a if item (c) is equal to 1. 

Assuming that S(x) is infinite, prove that there exist a), a2, a3,... € N 

such that 


(e 


Na 


xX =a\, a) = 9(a2), a2 = Y(a3), 43 = Y(a4),... 


(f) Continue assuming S(x) to be infinite. Use the result of item (b) to prove 
that there exists n € N for which 


u(dan) — u(An+1) = u(An+2) Sie, 


ra 


Conclude from item (d) that, with respect to the integer n of item (f), if 
i > n-+2 then there exists a prime pj; > 7 such that p; = 3 (mod4) and 


mj; € N such that aj = 2”: p;. Then, prove that myj+42 = mpn43 = --- and, 
for every i > 1, pj = 2pji-1 + 1. 


(g 


10.3 Linear Congruences and the Chinese Remainder Theorem 263 


(h) Prove that there is no infinite sequence q1, g2, q3, ... of primes such that 
gi = 2gi-1 + 1 for everyi > 1. 
(i) Conclude that S(x) infinite > x € T. 


10.3. Linear Congruences and the Chinese Remainder 
Theorem 


In this section, we study linear equations and systems of equations involving 
congruences. 
We first consider linear congruences, i.e., congruences of the form 


ax = b(modn), (10.6) 


with a, b and n being given integers, such that a 4 0 andn > 1, and we search for 
the solutions (or roots) x € Z of the equation, i.e., the integers x for which (10.6) 
holds true. 

As a particular case of the above situation, we say that a is invertible modulo n 
if the linear congruence (10.6) has a solution when b = 1, ie., if there exists x € Z 
such that 


ax = 1(modn). 


If this is so, then such an integer x is said to be an inverse of a, modulo n. In this 
respect, we have the following important 


Proposition 10.23 An integer a is invertible modulo n if and only if gcd(a, n) = 1. 
In this case, any two inverses of a, modulo n, are congruent modulo n. 


Proof If a is invertible modulo n, then there exists x € Z such that ax = 1 (modn). 
Then, there exists y € Z for which ax = ny+1 or, which is the same, xa+(—y)n = 
1. In view of such an equality, we know from Corollary 6.16 that gcd(a,n) = 1. 

Conversely, recall that Bézout’s theorem guarantees that the gcd of any two inte- 
gers can always be written as a linear combination of them. Hence, if gcd(a, n) = 1 
then there exist x, y € Z such that ax + ny = 1, and it immediately follows from 
this that ax = 1 (modn). 

Finally, let both x and y be inverses of a, modulo n. Then 


ax = 1=ay(modn), 


so that ax = ay (modn). However, since gcd(a,n) = 1, Proposition 10.6 gives 
x = y(modn). Thus, a has, modulo n, a single inverse. oO 


Corollary 10.24 Ifa, p € Z, with p prime, then a is invertible modulo p if and 
only if p { a. Moreover, in this case the inverse of a modulo p is unique. 
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Proof Immediate from the fact that gcd(a, p) = 1 pfta. Oo 


Corollary 10.25 Let a,b,n € Z, withn > 1. Ifa is invertible modulo n, then the 
linear congruence ax = b (modn) always has a single solution modulo n, namely, 


x =a?™-|b (modn). 


In particular, every inverse of a, modulo n, is congruent to a? —', modulo n. 


Proof Ifa is invertible modulo n, then gcd(a, n) = 1. Hence, successively applying 
item (f) of Proposition 10.6 and Euler’s theorem, we get 


ax = b(modn) & a?™ x = a?™—'b (modn) & x = a?™~'b (modn). 


oO 


Since any two inverses of a modulo n are always congruent modulo n, from now 
on we shall say that a has a single inverse modulo n. Such an inverse will be denoted 
by a~!, whenever there is no danger of confusion with the usual inverse + of a with 
respect to the multiplication in Q. 

As an application of the notion of inversion modulo n, we shall now prove the 
famous primality criterion known as Wilson’s theorem.* 


Theorem 10.26 (Wilson) A natural number p is prime if and only if 


(p — 1)! = —1 (mod p). 


Proof If p = mn, with m and n integers and | < n < p, then (p — 1)! 


A Ill 


—1(mod p) implies (p — 1)! = —1 (modzn). On the other hand, since 1 < n < 
p — 1, we also have that n | (p — 1)!, ie., (p — 1)! = O(modn). Therefore, 
0 = (p — 1)! = —1 (moda), and hence n | 1, which is a contradiction. 


Conversely, let p be prime and consider the function 
f :{1,2,...,p—-—1}—> {1,2,..., p— 1}, 
that associates to eacha € {1,2,..., p — 1} its inverse ate {1,2,...,p— 1}, 


modulo p. Corollary 10.25 assures that f é uma bijecdo, with f(a) =b <> f(b) = 
a. Moreover, 


3 After John Wilson, English mathematician of the seventeenth century. Although it is a primality 
test, in practice Wilson’s theorem is not so useful, for, according to Stirling’s formula (cf. [8], for 
instance), n! grows exponentially as n — +00. 
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f(a) =a a’ = 1 (mod p) 
© p| (a*—1) 
& p|(a—1) or p| (a+1) 
©a=lorp-l. 


Hence, there exist aj,...,@p-3 € {1,2,..., p — 1} such that 


p-3 


11,2,...p-=,p—-HUUffa,ar. 


i=1 


Finally, it follows from this, together with the fact that aja; ' = 1|(mod p) for 


1 <i < 2,°, that 


p-3 


“oF 
(p — 1)! = (p—1) | [@a;') = -1 (mod p). 


i=l 
oO 


Back to equations involving congruences, a natural generalization of the linear 
congruence (10.6) is obtained by considering solutions of a system of linear 
congruences. In this sense, the following result, known as the Chinese remainder 
theorem, examines the existence of solutions for such systems. 


Theorem 10.27 Let mj, m2, ..., mx be pairwise relatively prime naturals, all 
greater than |. For arbitrarily given integers aj, az, ..., ag, the system of linear 
congruences 


xX = a; (modm}) 


xX = ay (modmy?) (10.7) 


xX = ag (mod mx) 
admits a unique solution, modulo m\m2...mx. Yet in another way, there exists a 
single integer0 < y < mymz...mg such that x € Z satisfies the system above if 
and only if 
x = y(modmym2...mx). 


Proof Note first that if x; and x2 are any two solutions of the above system, then 


Xj =daj =x2(modm;), V1 <i <k. 
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However, since m1, ..., mg are pairwise relatively prime, it follows from 
Proposition 10.6 that 


xy = x2 (modmym2...mx). 


Hence, if the system (10.7) has a solution at all, this will be unique, modulo 
mim2...M . 
For the existence of a solution, for 1 < j < k let 


yj= I] mj, 


1<i<k 


atl 


so that gcd(y;, mj) = 1. Let also b; be the inverse of y; modulo m; and x = 
am ajb;y;. For a fixed 1 < / < k, we have y; = 0(modm;) whenever j 4 J; 
hence, modulo m; we get 


x= abi y, = d]!° l= al (mod m7). 


oO 


Problem 5 offers an alternative proof for the existence of solutions for the system 
of linear congruences (10.7). In order to finish this section, let us see an example 
who shows how it is possible to apply the Chinese remainder theorem to obtain 
interesting results.4 


Example 10.28 Given an integer n > 1, show that there exist n consecutive and 
composite natural numbers. 


Proof Choose n distinct primes pj, p2, ..., Pn and consider the system of linear 
congruences 


x = —1 (mod p7) 
x = —2 (mod p}) 


x = —n (mod pe) 


Since p1, p2,..., Pn are pairwise relatively prime, the Chinese remainder theorem 
assures the existence of m € N satisfying the above system. Hence, P; | (m+ J) for 
1 < j <n,sothatm+1,m+2,...,m-+n are all composite natural numbers. O 


‘Tn this respect, see also Problems 12, page 309, and 4, page 314. 
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Problems: Sect. 10.3 


1. Let a, b and n be given integers, with n > 1. With respect to the linear 
congruence ax = b (modn), prove that: 


(a) There is a solution if and only if gcd(a, n) | b. 
(b) If gcd(a,n) | b, then the linear congruence has exactly gcd(a, n) pairwise 
incongruent solutions, modulo n. 


2. Let aj, do, ..., a, b and n be given integers, with n > 1. Prove that the 
congruence 


AX, + agxg +--+ + agx, = b (modn) 


has solutions if and only if gcd(a1, a2,..., ag, n) | b. 

3. A group of soldiers was arranged as to form a rectangular array, with several 
rows. The commander observed that by placing 12 soldiers in each row, 7 soldiers 
would be left, while by placing 13 in each row, 5 would be left. Knowing that the 
total number of soldiers ranged between 600 and 700, find out how many were 
them. 

4. In the hypotheses of Theorem 10.27, do the following items: 


(a) Show that solving (10.7) for k = 2 is equivalent to solving, in u, v € Z, the 
linear diophantine equation 


mu —m2v = a2 —- a). 


(b) Show that solving (10.7) is equivalent to finding out all y € Z that solve the 
system of linear congruences 


my = a2 — a, (modm?) 
my = a3 — a (modm3) 


my = aq — a, (modm g) 


(c) Apply the procedure delineated along items (a) and (b) to find out all integer 
solutions of the system of congruences 


x =2(mod5) 
x =4(mod11). 
x = 9 (mod 13) 


5. In the notations of Theorem 10.27, if yj; = Theiz m; for 1 < j < k, prove that 
ifj 


x= ye 14j oo ’ solves the system of linear congruences (10.7). 
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6. Let p; = 2, p2 = 3, p3 = 5, ... be the sequence of primes. Show that there exists 
a natural number which leaves remainder 2?«—! — 1 upon division by 2?* — 1, for 
2<k < 100. 

7. Prove that there are arbitrarily long sequences of consecutive natural numbers 
such that none of them is a perfect power of exponent greater than 1. 

8. Let m = Py ee oe be the canonical decomposition of the integer m > 1 
in powers of distinct primes, and f be a polynomial function with integer 
coefficients. 


(a) Prove that the congruence f(x) = 0 (modm) has an integer solution if and 
only if each one of the congruences f(x) = 0 (mod Peds for 1 <i <k, has 
itself an integer solution.> 

(b) Given t € N, let M(t) denote the number of integer solutions, pairwise 
incongruent modulo t, for the congruence f(x) = O(modfr). Show that 
N(m) = N(p}')...N(p;"). 


9. (IMO shortlist) Let be given a nonconstant arithmetic progression of natural 
numbers, whose initial term and common difference are relatively prime. If it 
contains a perfect square and a perfect cube among its terms, show that it also 
contains a perfect sixth power.° 


5If p is prime and a € N, a sufficient condition for the solvability of a congruence of the form 
f(x) = 0(mod p*) will be presented in Problem 11, page 394. 

The hypotheses that the arithmetic progression is nonconstant and has relatively prime initial term 
and common ratio are actually unnecessary. They were assumed just to simplify the solution of the 
problem. 


Chapter 11 ®) 
Congruence Classes cen 


In this chapter, we return to the point of view of Example 2.21, looking at 
congruence modulo n as an equivalence relation. As a byproduct of our discussion, a 
number of interesting applications will be presented, among which is an alternative, 
simpler proof of Euler’s theorem. We will also introduce the quotient set Z, and 
show that it can be furnished with operations of addition and multiplication quite 
similar to those of Z. In particular, the case of Z,, with p prime, will be crucial to 
our future discussion of polynomials. 


11.1 Systems of Residues 


Given an integer n > 1, Proposition 10.5 assures that the relation ~ in Z, defined by 
a~b#sa=b(modn), 


is an equivalence relation, called the congruence relation modulo 7. 
For a € Z, we let the congruence class of a, modulo n, be defined as the 
equivalence class a of a with respect to the relation of congruence modulo n: 
a={xeZ; x =a(modn)} 
={xeZ; x =a+ngq, iq €Z} (11.1) 
={...,-2n+a,—-n+a,a,n+a,2n+a,...}. 


Given a € Z, we know that there exists a single integer 0 < r < n (the remainder 
of the division of a by n) such that a = r (modn). Therefore, {0,1,...,2 — 1} is 
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a system of distinct representatives for the relation of congruence modulo n, and 
Proposition 2.16 together with (2.12) furnish the partition 


Z=OU1U---Un—I (11.2) 


of Z. More generally, we have the following important 


Definition 11.1 A complete residue system modulo n, (abbreviated CRS) is a 
system of distinct representatives for the congruence relation modulo n. 


It follows from (11.2) and (11.1) that every CRS modulo n is a set {ag, a1, 
...,@,—1} of integers, such that a, = r (modn) for 0 < r < n — 1. Alternatively, 
since 0 = 7, we can view a CRS modulo na set {aj, a2,..., dy} of integers, such 
that a, = r (modn) for | < r <n. Yet in another way, a set of n integers is a CRS 
modulo n if and only if its elements are pairwise incongruent modulo n. 


Example 11.2 Given anintegern > 1, the sets {0, 1, 2,...,n—1} and {1,2,...,”} 
are obviously CRS modulo n. Another example is furnished by the set 


[x eZ; —S<x<s}. 
2, 2 


Indeed, note first that if —5 <x < y < 5, thenx # y(modn), for0 < y—x <n. 


Now, it suffices to separately consider the cases n even and n odd, in order to show 
that the set displayed above has exactly n elements. 


The coming proposition provides a useful way of constructing new CRS out of 
given ones. 


Proposition 11.3 


(a) Let a, b and n be given integers, with n > | and a and n relatively prime. 
If the set {xo, X1,..-,Xn—1} is a CRS modulo n, then the set {axo + b, ax, + 
b,...,aXy—1 +b} is also a CRS modulo n. 

(b) Let m and n be integers greater than | and relatively prime. If the sets 
{x0,X1,---,;Xm—1} and {yo, y1,.--, Yn—1} are CRS modulo m and n, respec- 
tively, then the set 


{nx; + myj; O<i<m,0<j <n} 


is a CRS modulo mn. 
Proof 


(a) Since every CRS modulo n has n elements, it suffices to show that if O < i, 
j <nand 


ax; + b= ax; +b(modn), 
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then i = j. To this end, it follows from the above congruence that axj = 
ax; (modn). Now, since gcd(a, n) = 1, item (f) of Proposition 10.6 guarantees 
that x; = x; (modn). Finally, since {xo, x1, ...,X»—1} is a CRS modulo n, this 


last congruence gives us i = j. 
Analogously to the proof of item (a), it suffices to show that if 0 < i, 7 < m, 
0<k,l <nand 


(b 


Ye 


nx; + Myz = nx; + my, (modmn), 


then i = j and k =/. Assuming the validity of the above congruence, we have 
from item (g) of Proposition 10.6 that 


nx; + my = nx; + my (modm) and nx; + myx = nx; + my; (modn). 
In turn, the last two congruences above readily give 
nx; =nxj;(modm) and myg = my; (modn). 


However, since m and n are relatively prime, by applying again item (f) of 
Proposition 10.6 we obtain 


Xj =x; (modm) and yx = yj (modn). 
Finally, since {xo, X1,.--,Xm—1} and {yo, y1,---, Yn—1} are CRS modulo m and 
n, respectively, we geti = j andk = 1. oO 


The previous proposition furnishes a proof of a particular case of the chinese 
remainder theorem which sheds light in the proof of the general case presented in 
the last section. In this respect, see also Problems 2 and 3. 


Example 11.4 Let m, and m2 be relatively prime naturals greater than 1, and a, and 
az be arbitrarily given integers. We will show that the system of linear congruences 


me (mod m1) (11.3) 


X = ay (modm?) 


has a single solution, modulo m m2. 
To this end, item (b) of the previous proposition assures that 


{m uy +mu2; x; € Z and 1 <u; < m; fori = 1, 2} 
isa CRS modulo m,mp. If we set x = mju, + my2u2, for some such uw, and u2, then 


xX =m2u2 (modmy) and x = m,u, (modm?). 
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Therefore, by the transitivity of congruences, x will solve (11.3) if and only if 
we can choose 1 < uy < my, and | < u2 < mz such that 


m2u2 = a, (modmy) and miu, = a2 (modmy). 


Although Corollary 10.25 assures we can uniquely choose such uw; and uz, this 
is also an immediate consequence of item (a) of the previous proposition. Indeed, 
since gcd(m 1, m2) = 1, that item guarantees that 


{m2x2; 1 < x2 < m4} 


is a CRS modulo mj; hence, there is a single 1 < u2 < my, such that m2u2 = 
a, (mod m,). By the same token, there is a single 1 < uw; < mz such that mju, = 
a2 (mod mz). 


The coming example brings a beautiful combinatorial application of the concept 
of complete residue system, which appeared as the last problem of the 1995 IMO.! 
For what is to come, recall that for Z Cc R andr € R we write Z + r to denote the 
setZ+r={zt+r; ze Z}. 


Example 11.5 (IMO) Let p be an odd prime. Compute the number of subsets of p 
elements of the set {1,2,...,2p} such that the sum of the elements of each such 
subset is divisible by p. 


Solution Let F be the family of the ie ) — 2 subsets of p elements of the set 
{1,2,...,2p}, different from X = {1,2,..., ppandY ={p+1,p+2,...,2p} 
(notice that these two sets have sums of elements equal to multiples of p). 

Let ~ be the relation defined in F by setting A ~ B if and only if the following 
conditions are satisfied: 


(i) There exists 0 <r < p—1suchthat (AN X)+r= BMX (mod p). 
Gi) ANY=BNY. 


(Here, according to the paragraph that immediately precedes Example 2.22, for 
ré€RandZ CRweletZ+r={z+r; z€ Z}.) 

It is immediate to verify that ~ is an equivalence relation in F. On the other 
hand, given A € F and B € A (the equivalence class of A in F), we have 


B=(BNX)U(BNY) =[(ANX) +r] U(ANY). 


Therefore, the equivalence class A will have as many sets as its elements as there 
are distinct sets of the form (AN X) +r, whenr € Z varies from 0 to p — 1. 


'For another approach to this example, see Example 20.7. 


11.1 Systems of Residues 273 


In order to address this counting problem, let A’ = AN X and S’ = Yo. ey) x. 
Since A #4 X, Y, we have 6 # A’ # X, so that 1 < |A’| < p — 1. Hence, 


Yo x= Yo xtrlA=S' +r/A', 


xeA/+r xeA’ 


On the other hand, since ged(|A’|, p) = 1, item (a) of Proposition 11.3 guarantees 
that, as r € Z varies from 0 to p — 1, the numbers S’ + r|A’| thus obtained form 
a CRS modulo p. In particular, the sums )° <4’ 4, are pairwise distinct, so that 
the sets A’, A’ + 1,..., A’ + (p — 1) are themselves pairwise distinct. Therefore, 
the class A has exactly p sets, and it follows from Proposition 2.18 that there are 


exactly 
1 2 
((z)-?) 
Dp 2 

distinct equivalence classes. 


If we now show that in each of the classes above there is exactly one set whose 
sum of elements is a multiple of p, it will follow that the number of sets we wish to 
count is precisely 


(Recall that we cannot forget X and Y!) 

For what is left to do, fix A € F (and, hence, the class A) and let A” = ANY. 
We wish to count how many are the integers 0 < r < p—1 such that the sum of the 
p elements of the set 


B=[(ANX)+r]U(ANY) = (A +r)U A" 


is a multiple of p (remember that the set B above is a generic element of the class 
A). However, for such a B, it follows from what we did above that 


a Yi xt+)ox 


xeB xeA/+r xeA” 
=Doxtrla'l+ Dox 
xeA’ xeA” 


=) xtrlA|=S+rlA’, 


xeEA 
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where S = )° <4 x. Thus, such a sum will be congruent to 0 modulo p if and only 
if 


|A’|r = —S (mod p). 


But since gcd(|A’|, p) = 1, we know from Corollary 10.24 (or from item (a) of 
Proposition 11.3, with b = 0 and |A’| in place of a) that there exists a single 0 < 
r < p —1 such that the congruence displayed above is satisfied. Oo 


Back to the development of the theory, given g,r € Z, withO <r < a, recall 
that gcd(ng + 7,n) = gcd(r,n). Thus, we can define the gcd between n and a 
congruence class modulo n by letting 


gcd(r,n) = gced(x,n), for x €7; (11.4) 
in particular, 
gced(7,n) = gced(r, n). 


Note that, by Proposition 10.23, the congruence classes r for which ged(r, n) = 
1 are precisely those formed by the integers invertible modulo n. Therefore, we have 
the following 


Definition 11.6 A reduced residue system (we abbreviate RRS) modulo n is a set 
I of integers such that 


nF = 1, if ged(r, n) =1 
0, if gcd(r,n) A 1 


for every congruence class 7, modulo n. 


For a fixed integer n > 1, the most important example of RRS modulo n is the 
set 


{x € Z; ged(x,n) = 1 and 1 < x <n}. 


More generally, if J is a RRS modulo 2, then (11.4) easily assures that |7| = y(n), 
where ¢ stands for the Euler function. On the other hand, it is clear that every RRS 
I modulo n can be enlarged to a CRS modulo n, as well as that every CRS modulo 
n contains a RRS modulo n. 

The coming result teaches us how to construct new RRS out of known ones. 


Proposition 11.7 Let m,n > 1 be given integers. 


(a) Ifan integer a is relatively prime withn and {x,, x2, ..., Xgin)} is aRRS modulo 
n, then {ax|, aX2,..., AXgin)} is also a RRS modulo n. 
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(b) If{x1, X2,..., Xg(m)} and {y1, y2,---, Yoin)} are RRS modulo m and modulo n, 
respectively, then 


{nxj +myj; l<isg(m), l<sj<¢)} (11.5) 


is a RRS modulo mn. 
Proof 


(a) Since every RRS modulo n is part of a CRS, it follows from item (a) of 
Proposition 11.3 that 


ax; #ax;(modn), VI<i<j<g(n). 


It now suffices to show that gcd(ax;,) = 1, which follows immediately from 
item (c) of Proposition 6.22. 

(b) By invoking again the fact that every RRS modulo n is part of a CRS, item (b) of 
Proposition 11.3 assures that the set in (11.5) has exactly g(m)g(n) elements, 
which are pairwise incongruent modulo mn. However, since g(m)g(n) = 
g(mn) (recall that gcd(m,n) = 1), we conclude that such a set has g(mn) 
elements. Hence, in order to show that it is indeed a RRS modulo mn, we are 
left to showing that 


gcd(nx; +my;,mn) = 1 


for all 1 < i < g(m), 1 < j < gm). To this end, items (b) and (c) of 
Proposition 6.22 give 


gcd(nx; + my;,m) = gcd(nx;,m) = gcd(x;,m) = 1 
and, analogously, gcd(nx; + my;,n) = 1; therefore, by applying one further 
time item (c) of that result, we obtain ged(nx; + my;,mn) = 1. oO 


With the concept of RRS and item (a) of the former proposition at our disposal, 
we can give another, conceptually clearer, proof of Euler’s theorem. 


Theorem 11.8 (Euler) [fa and n are relatively prime integers, with n > 1, then 
a? = 1 (modn). (11.6) 
Proof Let k = g(n) and {x1, x2,...,xx} be a RRS modulo n. Item (a) of the 
previous proposition assures that {ax1,ax2,...,axx} is also a RRS modulo n; 
hence, modulo n the integers ax;, ax2, ..., ax, are, in some order, congruent to 


X1,X2,..., Xx. Termwise multiplication of the corresponding congruences gives 


ak x1Xx2...Xk = AX, + ax2...+ AX, =X, xX2...xX~ (modn). 
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Finally, since gcd(x1x2...xx,) = 1, we can invoke item (f) of Proposition 10.6 to 
cancel x;x2...x, out of the congruence above to obtain ak=1 (mod n). oO 


Problems: Sect. 11.1 


1. Let n > 1 be an integer and {a1, a2,..., a,} be a CRS modulo n. Show that, 
modulo n, we have 


0, if n is odd 
a+dg+-:::-+an=)j)y . . . 
3 if n is even 
The following problem generalizes item (b) of Proposition 11.3. The subse- 
quent one generalizes the argument of Example 11.4. 
2. Let m,, m2, ..., mz be pairwise relatively prime naturals, all greater than 1, and 
yj = [freee mj for 1 < j < k. Show that 
iAj 


{uiy] +ugm2 +---+ugmyg; 1 < uj < m; forl <i <k} 


isa CRS modulo mym2...m x. 
3. Use the result of the previous problem to prove the general version of the chinese 
remainder theorem. 


11.2 The Quotient Set Z,, 


Let us now examine the quotient set 
Zn = {0,1,...,n — 1}, (11.7) 


of Z with respect to the relation of congruence modulo n. Proposition 10.6 allows 
us to introduce in Z, two operations, to which we shall also refer to as addition and 
multiplication and which share properties analogous to those of the usual operations 
of addition and multiplication of integers. Actually, we have the following important 
result. 


Proposition 11.9 In Z,, the operations 


@@b=a+b andaOb=a-b, (11.8) 
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are well defined operations, commutative and associative. Moreover, © is distribu- 
tive with respect to ®, and 0 and | are identities (also called neutral elements) for 
® and ©, respectively, in the sense that 


a@0=a and aoOl=a, 


for every a € Zn. 


Proof Firstly, we have to show that the operations given by (11.8) are indeed 
well defined, in the sense that a @ b and a © b are independent of the chosen 
representatives for the congruence classes @ and b. 

To this end, if @ = ¢ and b = d, then a = c (modn) and b = d (modn), and it 
follows from Proposition 10.6 that 


a+b=c+d(modn) and a-b=c-d(modn). 


However, this is the same as 


a+b=c+d anda-b=c.-d, 
so that 
a@b=cOd and 4Ob=CcOd. 


What is left to do is simpler. For instance, for the commutativity of @, we get 
from the commutativity of the addition of integers that 


a@®@b=a+b=b+a=b)@a4; 


analogously, we prove that © is commutative and that @ and © are associative, i.e., 
that 


20 (b®t) = (0b) Oe 


and 


Z2OQ(bOD) =(40b) OF. 


The checking of the distributivity of © with respect to @ will be left as an exercise 
to the reader (cf. Problem 2). Finally, 


a@0=a+0=4 and a@Ol=a-l=a, 


as we wished to show. oO 


278 11 Congruence Classes 


For the sake of exemplifying, we show below the addition and multiplication 
tables of Ze: 


Addition Table for Z¢ 
® 0 1 2 3 4 5 
0 0 1 2 3 4 5 
1 1 2 3 4 5 0 
2 2 3 4 5 0 1 
3 3 4 5 0) 1 2 
4 4 5 0 1 2 3 
5 5 0 1 2 3 4 

Multiplication Table for Z¢ 
‘S) 0 1 2 3 4 5 
0 0 0 0 0 0 0 
1 0) 1 2 3 4 5 
2 0 2 4 0 2 4 
3 0 3 0 3 0 3 
4 0 4 2 0 4 2 
5 0 5 4 3 2 1 


Whenever there is no danger of confusion, we shall write simply + and ., 
instead of @ and ©, to denote the operations of addition and multiplication in Z,,, 
respectively. Proceeding this way, we call the reader’s attention to the fact that, 
whenever we write 


a+b=a+b anda-b=a-b, 


each of the + and - signs stand for two different operations. For instance, the first 

+ sign denotes the addition operation in Z,, whereas the second + sign denotes the 

usual addition of integers. Moreover, an analogous remark holds for both - signs. 
The associativity of the addition of Z, immediately gives the cancellation law 


a+b=a+t>bd=¢. 
Indeed, if@ +b =a +4, then 


=—a+(@+b)=-a+ @G@+0) 
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and, hence, 
(—a +4) +b= (—a+@) +¢. 


Now, since —a + @ = —a +a = 0 and Ois the neutral element for the addition of 
Zn, the equality above gives b = €. 

A cancellation law for the multiplication of Z, is somewhat more complicated. 
In order to examine it, we first have the following 


Definition 11.10 An element a € Z,, is a unit if there exists b € Z, such that 
a-b=1. 


As it happens within the ordinary number system of integers, the classes +1 are 
units in Z,, for +1 -+1 = 1. On the other hand, the news is that there might be 
other units; for instance, in Zo the classes 4 and 7 are units, since 


4-7=4-7=1=7.-4. 


Ifa € Z, is a unit, then we do have the following cancellation law with respect 
to multiplication: 


a@-¢=a-d>t=d. 


Indeed, taking b € Z, such that a-b = 1 (and, hence, also b - a = 1), it follows 
from @-¢ = a-d that 


b-@-0)=b-(G-d). 
Then, the associativity of multiplication furnishes 
(b-a@)-€=(b-a)-d 


or, which is the same, I -¢ = 1 - d. This last equality amounts to ¢ = d, as wished. 
In particular, if a € Z, is a unit, then the element b € Z, whose existence is 
guaranteed by Definition 11.10 is unique, for if 


then the cancellation law for multiplication gives b = ¢. From now on, we shall say 
that such a b € Z, is the multiplicative inverse of a in Z,,. 
The coming proposition characterizes all of the units of Z,,. 


Proposition 11.11 A congruence class G4 € Z, is a unit in Z, if and only if 
gcd(a,n) = 1. In particular, Z;, has exactly y(n) distinct units. 
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Proof By definition, 7 € Z, is a unit if and only if there exists b € Z, such that 
a-b=1,1.e., ab = 1 or, which is the same, 


ab = |(modn). 


Hence, a is a unit in Z, if and only if a € Z is invertible modulo n. However, by 
Proposition 10.23, this is the same as asking that gcd(a, n) = 1. 


For what is left to do, it suffices to note that, since Z, = {1,2,..., n}, the set 
of the units of Z, coincides with the set of the classes @ such that 1 < a < n and 
gcd(a,n) = 1. Since such a set has y(n) elements, we are done. oO 


We have now arrived at the most important case. 
Corollary 11.12 Jf p is prime, then every element of Zp \ {0} is a unit of Zp. 


Proof By the previous proposition, it suffices to show that, if @ 4 0 in Z p» then 
gcd(a, p) = 1. However, a # 0 is the same as a # 0 (mod p), or p { a; moreover, 
the proof of Lemma 6.41 assures that p { a if and only if gcd(a, p) = 1. Oo 


As far as the arithmetic operations of addition, subtraction, multiplication and 
division are concerned, the previous corollary allows us to put Z,, with p prime, on 
an equal footing with respect to Q and R (and, after Chap. 13, also with respect to 
the set C of complex numbers). All of these number systems are examples of what 
algebraists call fields. 

More precisely, in Z, (for every n > 1, not only n prime) we can define an 
operation —, called subtraction, which is analogous to the ordinary subtraction of 
real numbers. We do so by setting (cf. Problem 3) 


a—b=a-—b. 
We now restrict to the case in which n = p, a prime number. Writing @~! to denote 
the multiplicative inverse of a € Zp \ {O} (ie., letting a! denote the congruence 


class b € Z p whose existence is assured by Definition 11.10), we have 


a-ai=1; 


moreover, if a-b = ¢, witha,b,c € Z p and b Hx 0, then it is immediate to verify 
that 
—~ _ sl 
a=c-b . 


Hence, we can introduce in Z p anotion of division which is also similar to the usual 
division of rational (or real, or complex) numbers, by letting 


fora € Zp andbe Zp \ {0}. 
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Problems: Sect. 11.2 


RW 


. Write the addition and multiplication tables of Zs5 and Z7. 
. * Prove that, in Zp, the multiplication is distributive with respect to addition. 


More precisely, show that 


a:-(b+¢) =(@-b)+ (4-0), 


for every a, b,C € Zp. 


. * Prove that the operation of subtraction in Z, is well defined. 
. Show that, in Z19, all units are equal to their multiplicative inverses. 
. For integers m and n, with n > 1, let the operation of multiplication by m in 


Zy, be given by m- a = m-a. Show that it is well defined and such that 


m:-(a+b)=m-a+m-b and (mj +m2)-a=m,-a+m-G, 


for every m,m,,m2 € Zanda,b € Zp. 


. If an integer n > | is not prime, show that Z, has zero divisors, i.e., that there 


exist 7, b € Zn \ {0} such that a - b = 0. Also, show that if p is a prime number, 
then Z, does not possess zero divisors. 


. Let a and n be given integers, with n > 1. In Z,,, show that the equation a@-x = b 


has a solution for every b € Z, if and only if @ is a unit in Z,. In this case, show 
that the solution x € Zn is unique, being given by x =a! -b. 


Chapter 12 ®) 
Primitive Roots and Quadratic Residues Sack a 


In this last chapter devoted to Number Theory, we return to the analysis of the 
congruence 


a‘ = 1(modn), (12.1) 


concentrating ourselves in two distinct problems, briefly discussed below. 

As a first problem, we know from Euler’s theorem that, if gcd(a,n) = 1, 
then (12.1) is always satisfied for k = g(n), where ¢ stands for the Euler function. 
However, for a fixed a € Z relatively prime to n, we do not know if such a natural 
value of k is the least possible one. In this respect, one of the core results of this 
chapter is the characterization of the integers n > 1| for which there exists some 
integer a, relatively prime with n and such that the least possible natural exponent k 
in (12.1) is k = p(n); such an a will be called a primitive root modulo n. 

A second problem we want to address here requires an important change of point 
of view towards (12.1). Instead of fixing the basis a, prime with n, and looking for 
the values of k € N for which that congruence has a solution, we fix k € N and look 
for the integer value of a which solve it. Note that this amounts to finding the roots 
of the equation x* = 1 in Z,. However, due to the elementary character of these 
notes, we restrict ourselves to the case k = 2, when the desired roots a € Z will be 
the quadratic residues modulo n. 

Along our way through the analysis of the two problems above, several inter- 
esting applications will be presented. Among them, we would like to highlight yet 
another famous theorem of P. de Fermat, characterizing the natural numbers that can 
be written as the sum of the squares of two integers. 
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12.1 The Concept of Order Modulo n 


Let a and n be integers such that n > 1 and gcd(a,n) = 1. We know from 
Euler’s Theorem 10.19 that there always exists a (at least one) natural number k 
for which a* = 1 (modn), namely, k = g(n). However, nothing we have done so 
far guarantees that (7) is the least such k; for instance 23 = 1 (mod7), even though 
y(7) = 6. These considerations motivate the following 


Definition 12.1 Given relatively prime integers a and n, withn > 1, the order of 
a, modulo n, denoted ord, (a), is the least possible h € N for which 


a" = 1(modn). 
We obviously have 
ord, (a) < p(n). 


On the other hand, the coming result establishes the most elementary properties of 
the concept of order modulo n. 


Proposition 12.2 Leta,n € Z, withn > 1 and gcd(a,n) = 1. 


(a) If ord, (a) = h, then the integers 1, a, a’, ..., a'—| are pairwise incongruent, 
modulo n. In particular, if ord,(a) = y(n), then the set {1,a,a’,...,a?™—} 
is a RRS modulo n. 


(b) ak = 1(modn) © ord, (a) | k. In particular, ord, (a) | p(n). 
Proof 


(a) If there existed 0 < k < 1 < A such that a! = a*(modn), item (f) of 
Proposition 10.6 would give us a!~* = 1(modn), with O < 1—k < h. But 
this would contradict the minimality of h. 

For what is left to do, it suffices to see that, if ord,(a) = y(n), then the set 
{1,a,a’,...,a?—'} has y(n) integers relatively prime with n and pairwise 
incongruent modulo n. Thus, such a set is a RRS modulo n. 


(b) Let ord, (a) = h, so that, in particular, ail (mod n). If k = hl, then 
aé = q"™ = a") = 1! = 1(modn). 
Conversely, let k be a natural number satisfying a* = 1 (modn). The division 
algorithm assures the existence of integers g and r, with O < r < h, such that 


k =qh-+r. Thus, we have 


1 =ak =a" — (q")4 «a = 14 -a" =a’ (modn). 
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Now, if r > 0, then r would be a natural exponent less than 4 and such that a” = 
1 (mod zn); in turn, this would contradict the minimality of h. Therefore, r = 0 and 
hence h | k, i.e., ord, (a) | k. 

Finally, ord,(a) | g(n) follows from the first part of item (b), together with 
Euler’s Theorem 10.19. oO 


Example 12.3 Compute the orders of 2 modulo 17 and of 7 modulo 10. 


Solution If h = ord,7(2), then h | g(17) = 16, so that h = 1, 2, 4, 8 or 16. It is 
sane that h # 1, 2; also, 24 = —1 (mod 17), and hence h ¥ 4. On the other hand, 
= (2+)? = (—1)* = 1 (mod 17), so that h = 8. 
If h = ordjo0(7), then h | g(10) = 4, so that h = 1, 2 or 4. Obviously, h 1; 
however, since 7” = —1 (mod 10), we also have h A 2. Thus, h = 4. oO 


Let us now see how Proposition 12.2 can be applied to an interesting problem. 


Example 12.4 If p is an odd prime, prove that all prime factors of 2? — 1 are of the 
form 2kp + 1, for some k € N. 


Proof If q is a prime factor of 2? — 1, then g is odd and 2? = 1 (mod q). It follows 
from item (b) of the previous proposition that ord, (2) | p and, hence, ord, (2) = 1 or 
p. However, if ord, (2) = 1, then g = 1, which is an absurd; therefore, ord, (2) = p. 

On the other hand, it follows from Fermat’s little theorem that 27—! = 1 (mod q). 
Then, again by item (b) of the previous proposition, we have 


p = ord,(2) | (g — 1). 


However, since g is odd, there must exist k € N such that g — 1 = 2kp. Oo 


The coming result establishes other useful properties of the concept of order 
modulo n, which will be of paramount importance for further developments. 


Proposition 12.5 Let a and n be relatively prime integers, withn > 1. 


(a) ord, (a) = ord, (a +n). 
(b) Ifm > 1 is anatural number such that m | n, ne ord, (a) | ord, (a). 
(c) If ord,(a) = h andk €N, then ord, (a*) = 


aaah ecd(i,k)* 

(d) Ifk €N, then ord, (a*) = ord, (a) & gced(ord, (a), k) = 1. 

(e) If ord,(a) = h, then the set {a, a, ..., a"} has exactly p(h) elements of order 
h modulo n. 

Proof 


(a) It follows from a = a +n (modn) that a = (a+n)‘ (modn), for everyk EN. 
In particular, a‘ = 1 (modn) if and only if (a + n)k = | (modn), and hence a 
and a + n have equal orders modulo n. 
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(b) Since m | n, item (g) of Proposition 10.6 yields that if ak¥ =1 (mod n), then 


a® =1 (mod m). In particular, since a‘ = 1(modn) when k = ord, (a), we 


have 
an = | (modm). 


Item (b) of Proposition 12.2 now furnishes ord, (a) | ord, (a). 


(c) Let d = gcd(h, k). By item (b) of Proposition 12.2, we have 


(a) =1(modn) & a =1(modn) Sh | kj 


bil ig 
1 ald 


Qs 


>? i | 
d 
where in the last equivalence we have applied item (a) of Proposition 6.22, 


together with the fact that gcd (4, 9) = |. From this, we immediately get 


h h 


io3S S 
Sree = a aed 


(d) This obviously follows from (c). 


(e) 


By item (d), the number of exponents | < k < A such that a* has order h 
modulo n equals the number of exponents 1 < k < h which are relatively 
prime with h. Therefore, it is equal to g(h). Oo 


Problems: Sect. 12.1 


Noe 


. Compute ord7(2), ord; (2) and ord)5(7). 
. Prove that, for every positive integer n, the number 23" + 1 is nota multiple of 


17. 


. Let a and n be relatively prime integers, with n > 2. If there exists a natural k 


such that a* = —1(modn), prove that ord, (a) is even. 


. (Putnam) Find all n € N for which n | (2” — 1). 
. (Turkey) For each n € N, prove that n! divides the product 


n—-1 


[ [e” a2), 


j=0 


. Given a natural n > 2, we label the vertices of a regular 2n-gon P as 1, 2, 3, 


...,n, —n, —(n — 1),..., —3, —2, —1, successively and in the clockwise sense. 
Then, we choose vertices of P in the following way: at the first round, we choose 
vertex 1; on the other hand, if vertex k; was chosen at the i-th round, then at the 
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(i + 1)-th round we choose the vertex situated |k;| vertices away from vertex k;, 
in the clockwise sense if k; > O and in the counterclockwise sense if kj < 0. 
Such a procedure continues until we choose a vertex which was already chosen 
in a former round. At the end of this process, let f(m) be the number of non 
chosen vertices. 


(a) If f(n) = 0, prove that 2 + | is an odd prime. 
(b) Compute f (1997). 


7. (IMO—adapted) Given a prime number p, we want to show that there exists 
a prime number g such that g { (n? — p), for all n € N. To this end, do the 
following items!: 


(a) Show that it suffices to find a prime g for which ord,(p) = p andq # 
1 (mod p). 

(b) Letting a = ah show that a has at least one prime factor g for which 
q # 1 (mod p”), and that ord, (p) = p. 


12.2 Primitive Roots 


Given integers a and n such that n > 1 and gcd(a,n) = 1, in this section we will 
be particularly interested in the case in which ord, (a) = y(n). Such a case is so 
important that is worth the following 


Definition 12.6 Let a and n be integers such that n > 1 and gcd(a, n) = 1. We say 
that a is a primitive root modulo n if ord, (a) = y(n). 


Let us take a look at a couple of examples. 
Examples 12.7 


(a) Since 2! = 2(mod3) and g(3) = 2, it follows that ord3(2) = 2 = (3), 
ie., 2 is a primitive root modulo 3. Similar computations show that 2 is also a 
primitive root modulo 5. 

(b) Modulo 7 we have 2! = 2, 2? = 4 and 2? = 1. Hence, ord7(2) = 3 < 6 = g(7) 
and it follows that 2 is not a primitive root modulo 7. 


The main result of this section is the characterization of all integer values n > 1 
for which primitive roots modulo n do exist. More precisely, we shall show that an 
integer n > 1 has a primitive root if and only if n = 2, 4, p* or 2p*, for some 
odd prime p. We start this path by computing the number of pairwise incongruent 
primitive roots modulo n, provided such a root does exist. 


'The coming items are based on the solution of Professor Samuel B. Feitosa. 
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Proposition 12.8 Jf an integer n > 1 has a primitive root, say a, then every 
primitive root modulo n is congruent to one of the elements of the set 


{ak; 1 <k < y(n) and ged(y(n), k) = 1}. 


In particular, n has exactly p(p(n)) pairwise incongruent primitive roots. 


Proof The second part follows immediately from the first, by the very definition of 
Euler’s function: the number of exponents | < k < y(n) such that k is relatively 
prime with g(7) is exactly y(g(n)). 

For the first part, letting a be a primitive root modulo n, we have that ord, (a) = 
y(n) and A = {a,a*...,a?™} is a RRS modulo n. Hence, any primitive root 
modulo n is congruent, modulo n, to one of the elements of A, so that it suffices to 
see which elements of A have order (modulo n) equal to y(n). However, by item (d) 
of Proposition 12.5, for 1 < k < y(n) we have 


ord, (a*) = y(n) & ged(y(n), k) = 1. 
oO 


Example 12.9 Example 12.7 assures that 2 is a primitive root modulo 5. Since 
(5) = 4, the previous propositions teaches us that a set of pairwise incongruent 
primitive roots modulo 5 is 


(2k. 1<k <4 and gced(4,k) = 1} = {2, 2°}. 


Hence, the pairwise incongruent primitive roots modulo 5 are 2 and 3 (for 27 = 8 = 
3 (mod 5)). 


We now go towards characterizing the integers which do possess primitive roots, 
starting with a necessary condition. 


Theorem 12.10 Jf and integer n > 1 has primitive roots, then n = 2, 4, p* or 2p*, 
where p is an odd prime and k is a natural number. 


Proof Firstly, note that the integers n > | which differ from those at the statement 
of the theorem are either of the form n = bc, with b,c > 2 being relatively prime 
integers, or of the form n = 2k for some integer k > 2. It thus suffices to show 
that such integer values of n do not have primitive roots, and we shall do this by 
showing that every integer a relatively prime with n satisfies ord, (a) < p(n). We 
look separately at the two possibilities above: 


(i) n = bc, with b,c > 2 relatively prime integers: since gcd(b, c) = 1, we have 
y(n) = g(bc) = ¢v(b)¢(c). On the other hand, condition b, c > 2 assures (cf. 
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Problem 19, page 220) that y(b) and g(c) are both even. If a € Z is prime with 
n, then a is prime with b and c and, from Euler’s theorem, we get 


g?n/2 — (aor = 19(/? = | (modb) 
and 


gt? = (gH) ge 19 x1 mod) 


Therefore, item (h) of Proposition 10.6 gives a?™/2 = | (modn); in particular, 


st, z on) SG) 


and, hence, a is not a primitive root modulo n. 
(ii) n = 2*, for some integer k > 2: let a be an odd integer (i.e., prime with 2). If 
we show that 


a” =1(mod2), 
we will have 
ord: (a) < ak? < 2k! = g(2*) 


and a will not be a primitive root modulo 2*. For what is left to do, let us 
make induction on k: the initial case k = 3 follows from Proposition 10.9, 
since a? = 1 (mod 8) for odd a. Assume we had already proved that, for some 
integer k > 3, there exists g € N such that a = 2kq + 1. Then, 


ok-1 


9\2 
a = (a® ") = (kg +1)? = 2g? + 241g 41 
= Qe gk-1 9? + 9g) +1 = 1 (mod 2*t), 


as wished. oO 


We are left to establishing the converse of the previous theorem, namely, that all 
numbers of one of the forms 2, 4, p* and 2p*, for some odd prime p and some 
k € N, have primitive roots. It is clear that 1 is a primitive root modulo 2 and 3 is 
a primitive root modulo 4. The rest of this section is devoted to the analysis of the 
two remaining cases. 
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The coming result shows that it suffices to worry about the numbers of the 
form p*. 


Proposition 12.11 Jf p is an odd prime and a € Z is a primitive root modulo pk, 
then a or a + p* is a primitive root modulo 2p*. 


Proof Let h = ord, ,« (a). If a is odd, then a is relatively prime with 2 p*. Then, by 


using item (b) of Proposition 12.5, together with the fact that ord pk (a) = ¢( p* ), we 
obtain 


9(p*) = ord « (a) | h = ords,« (a) | p(2p") = e(p'), 


where, in the last equality above, we used the fact that p is odd. Therefore, 
ord, pk (a) = 922 p*) and a is a primitive root modulo 2 p 
If a is even, change a by a + p* from the beginning and argue as above. oO 


We shall complete the proof of the converse of Theorem 12.10 (i.e., the analysis 
of the case p*), in two steps. Nevertheless, before that we need an auxiliary result. 


Lemma 12.12 Let p be an odd prime. If a is a primitive root modulo p?, then, for 
every integer k > 1, we have avr’) = by p* + 1, for some by € Z such that p + bk. 
Proof Let us make induction on k > 1. Fermat's little theorem gives a?~! = bj p+ 


1, for some bj € Z. If p | by, we would have apts 1 (mod p’) and, hence, 
ord p? (a)< p-1< 9 Pp’); this would contradict the fact that a is a primitive root 


modulo p?. 
Assume that, for some integer k > 1, we have av(P") = by p* + 1, with by € Z 
such that p { bg. Then, Newton’s binomial formula gives 


Pp 
gee") = (art?) — d + by p*)? 


p-1 
= 14 dep th + >. (“)oin" + by pP. 
j=2 


Now, for 1 < j < p — 1 Example 6.42 assures the existence of cx, € N such that 
(7) = pcx. Therefore, the last expression above furnishes 


p=1 
aP PO) = 1 + bephth YS ceby plk*! + bp p?* 
j=2 
p-1 ; 
= 14 depth + ‘oy cyb} p—Dk-1 + be pe 2) ot 
j=2 
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Letting t denote the expression within parentheses and noticing that t € Z, we 
finally get 


qr) — 1+ by pt! + tp**? = 1+ (b + tp)p**. 
However, since p + bx, it suffices to let bey; = be + tp to obtain a??"*') = 
bey pst! + 1, with pt Best. - 


We can finally execute the first of the two steps needed to finish the proof of the 
existence of primitive roots modulo p*, for an odd prime p. 


Theorem 12.13 Let p be an odd prime and a be relatively prime with p. 


(a) Ifais a primitive root modulo p, then a or a+ p is a primitive root modulo p*. 
(b) If a is a primitive root modulo p and modulo p?, then a is a primitive root 
modulo p*, for every integer k > 1. 


Proof 


(a) It follows from item (a) of Proposition 12.5 and from our hypotheses that 
ordy(a+ p) = ord,(a) = p—1. On the other hand, item (b) of Proposition 12.2 
assures that 

ord,2(a + p), ord,2(a) | (p*) = p(p — 1), 
whereas item (b) of Proposition 12.5 gives 


p —1=ord,(a) | ord,2(a) and p—1=ord,(a + p) | ord,2(a + p). 


Hence, ord ,2 (a) = p—1or p(p-—1), the same being valid for ord ,2 (a+ p); 
thus, we are left to showing that 


ord,2(a) = p—1=> ord,2(a+ p) A p—1. 
To this end, if a?~! = 1 (mod p?), then modulo p” we have 


p-1 

—-1\., 

(a+ pp)?! =a? "4 (p—1)pa?* + 2 (’ j part 
j=2 


a?! + (p—1)pa?~* 


l= pa? * 41, 


for p fa. 
(b) Suppose that a is a primitive root modulo p and modulo p*. We shall use 
induction on k to prove that a is a primitive root modulo p*, for every k € N. 
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The cases k = 1 and k = 2 are our hypotheses. Assume, then, that we have 
already proved that a is a primitive root modulo p*, for some integer k > 2. 


Items (b) of Propositions 12.2 and 12.5 give us 


erly = 


9(p*) = ord,« (a) | ord yi+1 (a) | e(p py(p'), 


so that 


ord ,4+1 (a) = o(p*) or g(pkt}). 


On the other hand, since a is a primitive root modulo p?, the previous lemma 
furnishes a?) = by pk + 1, for some by € Z such that p { bg. In particular, 


a?) £1 (mod p**4), 


so that ord, «+1 (a) # g(p*). Oo 


Theorem 12.13 puts us in quite a nice position. Indeed, if we show that an odd 
prime p has a primitive root, say a, then a + p will also be such a root (by item (a) 
of Proposition 12.5); hence, item (a) of the previous theorem allows us to assume 
that a is a primitive root modulo p and modulo p?, and after that item (b) of that 
same result assures that a is a primitive root modulo p*, for every k € N. 

We are then left to showing that odd primes have primitive roots. Unfortunately, 
at this point such a proof is beyond the scope of the material we have at our disposal, 
and will have to wait until Sect. 19.3 (cf. Theorem 19.20) to be presented. Thus, for 
the time being we assume the existence of primitive roots modulo p, taking the 
opportunity to present some instructive examples. 


Example 12.14 Prove that 2 is a primitive root modulo 3* and modulo 5*, for every 
keN. 


Proof From Example 12.7 and the previous theorem, it suffices to show that 2 is 
a primitive root modulo 9 and modulo 25. Let us check that 2 is a primitive root 
modulo 9, the case of modulo 25 being entirely analogous: since g(9) = 6, we 
have ordg(2) | 6; however, since none of 2! 2 or Bisa multiple of 9, we get 
ordg(2) = 6 = g(9). Oo 


The next two examples show how powerful the information on the existence of 
primitive roots can be. 


Example 12.15 If p is prime and n € N, prove that 


0 (mod p), if (p — 1) }n 


P49 ai = 
meehce ese tp =I) ne eee 
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Proof If (p — 1) | n, say n = (p — Ik, then Fermat’s little theorem gives, for 
lsasp—t, 


a” = qP—Dk = 1k = | (mod p). 


Therefore, 


7427 4+.---+(p—-—1I)” =14+1+4+---+1=-—1 (mod p). 
—<—--—_—_—_—’ 
pal 


If (p — 1) { n, let a be a primitive root modulo p. Since {a, a’,..., aP-}) isa 
RRS modulo p, we conclude that the numbers a, a,...,a?} are, modulo p and 
in some order, congruent to 1, 2,..., p — 1. Hence (also modulo p), 

pn _ jn 
PO ee (pa ag a eee ——. 
qh — 


Now, by invoking Fermat’s little theorem once more, we get 


a?” — q" = (a?)" — a" =a" — a" = 0(mod p). 


On the other hand, since ord,(a) = p — 1 and (p — 1) t n, item (b) of 
Proposition 12.2 guarantees that p { (a” — 1). Therefore, p | eae a , aS we wished 
to show. oO 


Example 12.16 (IMO) Find all n € N for which n? divides 2” + 1. 


Solution It is clear that such an n must be odd and that n = | is a possible value. 
Then, let n > 1 be a natural number satisfying the given conditions and write n = 
pq, where p is the least prime divisor of n and k € N is the exponent of p in the 
canonical decomposition of n in prime powers. Then, gcd(p, g) = | and 


27 1 noe! 


N. 
2 pe 


n 


By Fermat’s little theorem, we have 2? = 2(modp), from where an easy 
induction gives 2P* =2 (mod p). Thus, modulo p we obtain 


O=2"41=2P941=2941, 


so that 227 = 1 (mod p). Letting t = ord,(2), it follows that t | 2q and (again 
by Fermat’s little theorem) ¢ | (p — 1); therefore, t | gcd(2q, p — 1). However, 
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since the prime divisors of g are greater than p, this forces us to have t = 2 and, 
consequently, p = 3. Hence, 
fee gi | 
ne: 32kg2 


N, (32) 


which successively implies 34 =-1 (mod 37*) and 
2234 = 1 (mod 3*), 


Since we have already shown (cf. Example 12.14) that 2 is a primitive root 
modulo 3", the above congruence gives 


2. 37-1 — 9(3*) = ordsx(2) | 2. 3*g, 


so that 3*—! | g. However, since p = 3 and gced(3,q) = 1, we must have k = 1. 
Therefore, n = 3g and we conclude from (12.2) that 


8741 
qe 


EeN 


By the sake of contradiction, assume that g > | and let w be the least prime 
divisor of g, say gq = w!v, where v € N and / is the exponent of w in the canonical 
decomposition of g as a product of prime powers. Once more from Fermat’s little 
theorem, we have gu! = 8 (mod w) and, hence, 


0=8741=8""41=8" 4+ 1(modw), (12.3) 
so that 87” = 1 (mod w). Letting t = ord,,(8), it follows that ¢ | 2v and (one more 
time from Fermat) t | (w — 1). However, since the prime factors of v (if they exist) 
are all greater than w, we conclude that t = 1 or 2, and this gives w | (8! — 1) 
or w | (3° — 1). In any case, it follows from w > p = 3 that w = 7, and (12.3) 
furnishes 


0=8'+1=1°+1=2(mod7), 


which is an absurd. We finally conclude that g = 1 andn = 3‘q =3. oO 


Problems: Sect. 12.2 


1. Show that 2 is a primitive root modulo 29. 
2. Let m,n € N be such that m | n, and let a € Z be a primitive root modulo n. 
Prove that a is also a primitive root modulo m. 
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3. Prove the following generalization of Wilson’s theorem: if n € N has primitive 
roots and 1 = aj < az < +--+ < dg(n) = n — | are the integers from | to n and 
relatively prime with n, then 


n | (aja2...dgiay + 1). 


4. (Vietnam) Show that, for every integer n > 1, there exists an integer k, > 1 
such that 19 — 97 is a multiple of 2”. 

5. IMO shortlist) Given k € N, prove that there are infinitely many natural 
numbers n for which n - 2 — 7 is a perfect square. 

6. (Romania) Find all prime numbers p and q, distinct from 2 and 3 and such that 
3pq | (a*?4-! — 1), for every natural number a relatively prime with 3 pq. 

7. (Brazil—adapted) Let p > 5 be a prime number for which pet is also prime, 
and let f : Z4 — R be given by f(xy) = f(x) f(y) and f(x + p) = f(r), 
for all x, y € Zy. 


(a) Show that f(0), f(1) € {0, 1} and that f is constant if f(0) = l or f(1) = 
0. 

(b) From now on, assume that f(0) = 0 and f(1) = 1. Use Fermat’s little 
theorem to conclude that f(a) € {—1, 1} for every a € N such that p { a. 

(c) If there exists a primitive root a modulo p such that f(a) = 1, then 


_ 0, if p|x 
eee 
(d) If f(a) = —1 for every primitive root a modulo p, then 


_ 40, if p|x 
f(x) = eee : (=1j0e tl (mod p), if ptx . 
8. (Turkey) Prove that the two following claims on n € N are actually equiva- 
lent: 


(a) nis square free? and, if pis one of its prime divisors n, then (p—1) | (2-1). 
(b) For every a € N, we have that n | (a” — a). 


9. (India—adapted) For n € N, let s, = 1+ -y_) k"—!. The purpose of this 
problem is to characterize all n € N such that n | s,. To this end, do the 
following items: 


(a) If p is prime and q € N, show that p | ra pe ay! 
(b) Conclude that, ifn | s,, then is square free. 


Cf. Problem 6, page 190. 
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10. 


11. 


12. 
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(c) Letn = p,... pr, with py < --- < p; being prime numbers, and qj; = 
wh 
Pi” 


i. Show that sn = 1 +i DP7'l"-! (mod pi). 
ii. Ifa is a primitive root modulo p;, show that 


qhrita-)) —q'-! 
sn = 1+ 4: | ——{——_} (mod p)). 
a —1 


iii. From ii., successively conclude that, if m | s,, then p; | (a"—! — 1), 
(pi — 1) | @— Vand (pj — 1) | (qi — ‘D. 

iv. Ifn | sy, use the results of items i. and iii. to conclude that p; | (g; — 1). 

v. Use the result of the two previous items to prove that n | s, if and only 
if pi(pi — 1) | qi — 1) forl si St. 


(Brazil—adapted) Let p be an odd prime, k 4 p, 2p be a natural number such 
that] <k < 2(p+ 1) andn=2pk+1. 


(a) If m is prime and a is a primitive root modulo n, prove that 
gcd(a* + 1,n) = 1. 

(b) Assume that there exists an integer 2 < a <n such that a’P = —1(modn) 
and gcd(a* + 1,n) = 1. 


i. If d = ord,(a), show that d | (n — 1) and d { 2k. Conclude that p | d 
and, hence, that p | g(2kp + 1). 
ii. Use the formula for g(2kp + 1) to infer that there exists an integer 
1 > 1 for which /p + | is a prime divisor of n. 
ili. Use n = 2kp + | to prove thatn = (lp + 1)(hp + 1) for some integer 
h e€ {0, 1}. Then, conclude that we cannot have h = 1, so that n is 
composite. 


Show that there exists n € N such that the last 1000 digits in the decimal 
representation of 2” are all equal to 1 or 2. 

The result of the coming problem is due to the French mathematician of 
the twentieth century Claude Chevalley, thus being known as Chevalley’s 
theorem. 

If p is an odd prime, we wish to show that, modulo p, the number of distinct 
solutions of the congruence 


xf + xg tg tg + xg = 0 (mod p) 
is a multiple of p. (Here, two solutions (aj,...,a5) and (bj,...,b5) are 


considered to be distinct if there exists 1 < i < 5 such that a; ¥ b; (mod p).) 
To this end, do the following items: 
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(a) If f(t,...,%5) = 1— ef +23 +--+ + x$)?7! and m is the number of 
distinct solutions of the given congruence, then 


m= > f(%,....%5) (mod p), 


where A = {1,2,..., p— 1}. 


(b) Modulo p, one has 
m=p-— ss Pe ere i 
ayte+a5s=p—1x1,...,.x5€A 
= oF (x eel) 
ayt--+as=p—l Sx,;eA X5EA 
(c) Ifa; +---+a5 = p—1, witha; > 0 for 1 <i < 5, then there exists 
1 <i <5 such that a; = 0 or (p — 1) { 4q;. 
(d) If (p — 1) { 4a, use a primitive root a modulo p to conclude that 
pol 4pa 4a 
4a __ 4ja_  F° 7G _ 
oD: =)a = spe = | met) 
xeA j=l 
(e) Finish the proof, showing that m = 0 (mod p). 


12.3 Quadratic Residues 


In this section, we study algebraic congruences of the form 


x =a (mod n). 


To this end, we shall need the following 


Definition 12.17 Let a,n € Z, withn > 1 and gcd(a,n) = 1. We say that a isa 
quadratic residue modulo n if the congruence 


x =a (mod n) 


has at least one integer solution x. Otherwise, a is said to be a quadratic nonresidue 
modulo n. 
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With respect to the above definition, our main task in this section will be to 
obtain necessary and sufficient conditions for an integer a to be a quadratic residue 
modulo n. In the text we shall concentrate ourselves in the case of a prime n, leaving 
the general case for the problems posed at the end of the section. 


Proposition 12.18 Let p be an odd prime. 


(a) If a is a quadratic residue modulo p, then the congruent x” = a (mod p) has 
exactly two incongruent solutions modulo p. 


(b) Ne ve integers 1, 2,..., p — 1 there are exactly >— + quadratic residues 
and °— 1 quadratic nonresidues modulo p. 

Proof 

(a) If 1% € Z are such that a = a(modp) and x = a(modp), then 


ch = xs (mod p), so that p | (ae —x3). However, since p is prime, we conclude 
that p | (x1 — x2) or p | (x1 + x2). Hence, the congruence stated in item (a) 
has at most two incongruent solutions modulo p, namely, x; and —x, (they are 
incongruent since p is odd and gced(a, p) = 1 > gcd(x1, p) = 1. On the other 
hand, letting a be a quadratic residue modulo p, we know that there exists a 
solution x9 € Z for the congruence x* = a (mod p). It immediately follows 


that —xo is also a solution of such a congruence, with —x9 ¥ xo (mod p). 


(b) Since {1, 2,..., p — 1} isa RRS modulo p, in order to count how many of its 
elements are ea residues modulo p, it suffices to compute how many of 
the numbers 17, 2”, ..., ( Be — 1)* are pairwise incongruent modulo p. To this 


end, note that if 1 <i < 4, then 


= (p —i)* (mod p); 


on the other hand, if 1 <i < j < pet then 


* % j* (mod p), 


since j? —i fae ee 
Therefore, there are wees quadratic residues modulo p and, hence, 
exactly (p — 1) ( Pe 5 +) = Pe tite nonresidues modulo p. oO 


The coming proposition is due to Euler, being known in mathematical literature 
as Euler’s criterion for quadratic residues. 


Proposition 12.19 (Euler) Jf p is an odd prime and a € Z, then a is a quadratic 
residue modulo p if and only if 


a’t =1(mod p). 
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Proof Assume first that a is a quadratic residue modulo p. Then, gcd(a, p) = | and 
there exists x9 € Z such that Xe = a (mod p). In particular, xo is also prime with p, 
and Fermat’s little theorem furnishes 


a p—1 = 
a’? = (x2)'F =x?! =1(modp). 


Conversely, suppose a?t = 1|(mod p), so that in particular gcd(a, p) = 1. 
Then, letting a be a primitive root modulo p, we have that {a, a?,...,a?—!} is 
a RRS modulo p. Therefore, there exists an integer 1 < k < p — 1 for which 


a* = a(mod p). Hence, 


ok) — ge = 1 (mod p), 


and it follows from ord,)(@) = p — 1 that (p — 1) |k (2). We conclude that k is 
i 


even, say k = 21, and letting x9 = a’ we get 


2l 


xe =a = a* =a (mod p). 


Thus, a is a quadratic residue modulo p. Oo 


Corollary 12.20 [f p is an odd prime, then an integer a, relatively prime with p, is 
a quadratic nonresidue modulo p if and only if 


p-l 


a2 =-—1(mod p). 


Proof Fermat’s little theorem gives 


pol pol p-l 
(a2 —l)\(a2 +1l)=a — 1=0(mod p). 


However, since p is prime, we get 


p-l 


p—1 
a2 —1=0(mod p) ora’ > +1=0(mod p). 


On the other hand, it follows from Euler’s criterion that a is a quadratic 
nonresidue modulo p if and only if the first of the two congruences above does 


not hold, i.e., if and only if a Py + 1 =0(mod p). oO 
Remark 12.21 It is possible to prove (cf. Problem 7, page 493) that if a € Z is not 


a perfect square, then there exist infinitely many primes p such that a is a quadratic 
nonresidue modulo p. 


In order to ease computations with quadratic residues and nonresidues, we shall 
need the following notation, introduced by A-M. Legendre. 
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Definition 12.22 For integers a and p, with p prime, we let the Legendre symbol 
(4) be given by: 


j 1, if a is a quadratic residue modulo p 
(<) = 4 —1, if a is a quadratic nonresidue modulo p . 
0, if p|a 
The convenience of Legendre’s symbol is explained by the next two results. 
Proposition 12.23 If p is an odd prime and a, b € Z, then: 
= a\_{b 
(a) a = b(mod p) => (4) = (4). 
(b) If p{ a, then (<) =1. 
(c) (4) = a'F (mod p). 
a b\ _ (ab 
(@) (3) (3) 7 ( P ) 


Proof Since the proofs of items (a) and (b) are immediate, let us start by proving 
item (c): if p | a, then 


a pal 
(<) =0=a 2 (mod p); 
p 


otherwise, Proposition 12.19 and Corollary 12.20 assure that, modulo p, 


—1 if a is a quadratic nonresidue modulo p 


eA 1 if a is a quadratic residue modulo p ( a ) 
a 


In what concerns (d), we first note that if p | ab, then p | a or p | b and, hence, 


ee) 


If p { ab, then p { a and p { b, and it follows from item (c) that 


(<) (2) = a pr = Ces = (=) (mod p). 
P/ \P P 


Therefore, p divides the difference (4) (4) _ (), which assumes an integer 


value ranging from —2 to 2. However, since p is odd, we conclude that one must 


me (8) (6) (2) = : 


The two coming examples bring interesting applications of quadratic residues. In 
both of them, we shall use the fact that if an integer n satisfies n = 3 (mod 4), then 
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n has a prime divisor p such that p = 3 (mod4). An argument for the validity of 
this (easy) fact can be found along the proof of Example 6.40. 


Example 12.24 Prove that there does not exist integers x and y for which y* = 
3 
x? +7. 


Solution By the sake of contradiction, assume that such a pair of integers does 
exist. Then x is odd, for otherwise we would have ye = x3 +7 = 3(mod 4), which 
in turn contradicts item (c) of Proposition 10.9. Now, 


ytlax4+8=( 42) - 1)? +3] (12.4) 


and, since (x — 1)? + 3 = 3(mod 4) (recall that x — 1 is even), there exists a prime 
p such that p = 3 (mod 4) and p | [(* — iy 44], 

Back to (12.4), we conclude that y? +1 = O(mod p), and —1 is a quadratic 
residue modulo p. It then follows from the definition of Legendre’s symbol, together 
with item (c) of the previous proposition, that 


is (=) = (iy = (—1)**t! = —1 (mod p). 
P 


This contradicts the fact that p is odd. Oo 


For the next example, we extend the original Fibonacci and Lucas sequences (cf. 
Problem 2, page 71) to homonymous sequences (Fy)nez and (Ln)nez, by imposing 
that the extended sequences satisfy the same recurrence relations as the original 
ones, namely, Fy = F2 = | and Fj42 = Fj41 + Fj for every j ¢ Z, and L; = 1, 
Lz = 3 and Lj42 = Lj+1 + Lj, for every integer j € Z (so that, for instance, 
Fo = 0, F_; = 1, etc, and Lp = 2, L_; = —1, etc.). This way, it is a trivial task to 
prove that F_, = (—1)""!F, and L_, = (—1)"L, for every n € N, as well as that 
the extended sequences continue to satisfy all of the properties listed in Problem 2, 
page 71, and in Problems 11-13, page 252 (of course, with indices m, n and k now 
ranging in Z, instead of in N). 


Example 12.25 Let (Ln)nez be the extended Lucas sequence. Prove? that L, is a 
perfect square if and only if n = 1 or 3. 


Proof We consider three separate cases: 
(i) n = 2m, with m € Z: item (b) of Problem 2, page 71 (with m in place of n), 


gives 


Ln = Lom = L?, +2(-1)™1. 


3The result of this example and those of Problems 6 and 7, as well as their proofs, are due Cohn 


[10]. 
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Therefore, L,, differs from a perfect square by —2 or 2, so that it cannot be a 
perfect square itself. 

(ii) n = 1 (mod 4): ifn = 1, then L, = L, = 17. Otherwise, writen = 1+2-3!-k, 
with / € Zs andk € Z such that 2 | k but 3 { k. Item (d) of Problem 11, 
page 252 (with k in place of n) guarantees that L; = 3 (mod 4). Therefore, we 
can choose a prime p = 3 (mod4) such that p | Lx. On the other hand, by 
applying item (d) of Problem 13, page 253, with t = 3’, we get 


Ly, = —L, = —1(modLx), 


so that L, = —1 (mod p). Finally, using the fact (already established before) 
that —1 is not a quadratic residue modulo p, we conclude that L, cannot be a 
perfect square. 

(iii) n = 3 (mod 4): if n = 3, then L3 = 27, a perfect square. If n = 3 (mod 4) and 
n # 3, writen = 34+2-3!-k, with] € Z, andk € Z such that 2 | k but 3k. 
Arguing exactly as above, we get 


L, = —L3 = —4 (mod p), 


for some prime divisor p of Lx such that p = 3(mod4). This time, 
Proposition 12.23, together with the fact that —1 is not a quadratic residue 


modulo p, give 
CaCl 
= — 1, 
P P P 


and —4 is not a quadratic residue modulo p either. Hence, also as above, Ly, is 
not a perfect square. oO 


Back to the development of the theory, the next result, due to K. F. Gauss* and 
known as Gauss’ lemma, furnishes a much simpler procedure than that provided 
by Euler’s criterion to decide whether a certain integer is a quadratic residue or 
nonresidue modulo p, with p being a given odd prime. 


Proposition 12.26 (Gauss) /f p is an odd prime anda € Z is prime with p, then 


(4 = (—1)”, where m is the number of elements of the set 


=i = 
fisis®5 : ja=-l, 2... or = (PE) (moa py} (12.5) 


4 After Johann Carl Friedrich Gauss, German mathematician of the eighteenth and nineteenth 
centuries. Gauss is generally considered to be the greatest mathematician of all times. In the 
several different areas of Mathematics and Physics in which he worked, like Algebra, Analysis, 
Differential Geometry, Electromagnetism and Number Theory, there are always very important 
and deep results or methods that bear his name. We refer the reader to [38] for an interesting 
biography of Gauss. 
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Proof For 1 < j < _ let 2 <= po be the only integer such that 


ja = t; (mod p). 
Let us first prove that 
1<i<js?S simile. 
To this end, it suffices to see that, since gcd(a, p) = 1, we have 


It;| = |tj| => |ta| = |ja| (mod p) = |t| = |j| (mod p) 


=> i+ j =0(mod p), 


which is impossible. 
It now follows from what we did above that the numbers |f;|, ..., |f»-1 | form, 
= 


in some order, a permutation of 1, 2,..., pt (note that none of them equals 0, for 
gcd(a, p) = 1). Therefore, recalling that m denotes the quantity of indices j for 
which t; < 0, we get modulo p that 


p—1 -—1 -1 
(ita (2 i Cy a ea Saeco 
2 Z 2 2 


Hence, 
p—1 
a? =(-1)" (mod p), 


and the rest follows from item (c) of Proposition 12.23. oO 


As an application of Gauss’ lemma, we next show that 2 is a quadratic residue 
modulo p (an odd prime) if and only if p = 0, 1, 2 or 7 (mod 8). 


Example 12.27 If p is an odd prime, then 


(<) =n = en, 
P 


Proof By Gauss’ lemma, we have (3) = (—1)”, where 


m=#{lsk= 
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Ifl<k <|2"], then2 < 2k < 2|27'| < 2 and, hence, 


2k # —1,-2,..., (2+) «moa pp. 


If ety tisk < 2, then 


27 
pol pol paul 
—1>2k >2{ | —— 1 08 a | 7-2 
a (id ea eral 
in particular, 2k #1,2,..., P=} (mod P) or, which is the same, 


p-1l 
2k =—-—1,-2,... or — os (mod p). 


Hence, 


p-1l p-1 p-1l p-1 pt+l 
se ra) oa ntl 
where in the last equality above we separately considered the cases p = 4k + | and 


As for the second equality, it is enough to show that 


1 aaa 
[Pe | a P (mod 2). 
4 8 


In turn, to this end it suffices to look at the cases p = 8k+1, p = 8k+3, p = 8k+5 
and p = 8k +7. oO 


Continuing our study of quadratic residues, we are going to prove one of the 
most famous theorems of Gauss, which establishes a simple relationship between 
the Legendre symbols (2) and (4), whenever p and gq are distinct odd primes. Let 
us start by rewriting Gauss’ lemma in a suitable way. 


Lemma 12.28 [f p is an odd prime and a € Z is relatively prime with p and also 
odd, then (£) = (-1)™, with 
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Proof For1 <j < ae let 0 < rj; < p be the remainder upon division of ja by 
Pp, So that (cf. Proposition 6.7) 


ja= |= | ptrj. (12.6) 
P 


It is immediate that 


=] 1 
ja=-1,-2,..., (5 ) mod p) & PF* rj <p. 


Hence, in the notations of Gauss’ lemma, there are exactly m indices | < j < Bet 
ae < rj < p, and it suffices to show that the parity of the number of 


pol), 
such indices equals that of )> jal | : 


Write 51, 52, ..., Sm (after some reordering, if needed) to denote the remainders 


r; such that P + <rj < p,andt,..., t (also after some reordering, if needed) 


the remainders r; for which 1 < rj < pot Then m+n = Bo, and (as in the proof 


of Gauss’ lemma) for 1 <i <j < pot we have r; 4 rj, p — rj. Therefore, 


for which 


p—1 
{1.2.....4 5 | 1p = Sie S05 PS Ut --ootah 


a disjoint union, so that 


On the other hand, adding equalities (12.6) over 1 < j < a we obtain 


2 4 x Fi m 
@-0(% ) =e | 2] m+ 2.05; 
j=l 
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Finally, recalling that a and p are both odd and looking at the last equality above 
modulo 2, we get 


p-l 


Z ja 
oF LAI-8 
| Pp 


j= 


as wished. oO 


The coming result is known in mathematical literature as the Quadratic Reci- 
procity Law of Gauss. The proof we present is due to the German mathematician 
of the nineteenth century Ferdinand Eisenstein. 


Theorem 12.29 (Gauss) Jf p and q are distinct odd primes, then 


6) (5) enone 


Proof By the previous lemma, we have 
(2) (4) =(-D"(-)", 
q P 


p-l 


with 


[| men EL | 


jail P 


m= 


— 


It then suffices to show that 


p—l 


»|4|+>|24|-AAG&). an 


j=l j=l 


for which we use the following double counting argument: since the right hand side 
of the desired equality counts the number of points of integer coordinates within the 
closed rectangle 


R= {on reR lex adie ys]. 


it is enough to count the number of those points in some other way, thus obtaining 
the left hand side of (12.7). 

For what is left to do assume, without any loss of generality, that p > gq, and 
consider the straight line y = pr (cf. Fig. 12.1). For each j > 1, the nonnegative 
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Fig. 12.1 Counting the y 
number of points of RN Z? 1 


SIR 


integer a counts the number of positive integers less than or equal to a; on the 


other hand, for 1 < j < pt we have 4 ¢ Z, so that a counts the number 


of points of integer coordinates situated on the straight line x = j, below the line 
y= 5% and above the line y = 0. However, since 


jq|_|@=)a|_|qaq_4 <|4)=4"* 

pil” 2p 2 2p} 2 2 
all of the points we are counting actually belong the closed rectangle R. Hence, in 
the notations of Fig. 12.1, we have 


p-1l 
De . 
> || = # points of integer coordinates within region I of R. 


jail P 


Analogously, 


q-1 

ae a é 
|? | = # points of integer coordinates within region II of 7, 

q 

j=l 


and there is nothing left to do. Oo 


We finish this section by applying the quadratic reciprocity law to the establish 
the existence of certain prime numbers. 


Example 12.30 Prove that there are infinitely many primes of the form 3k + 1, with 
KEN. 


>For a more general result, see Theorem 20.19. 
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Proof By the sake of contradiction, suppose that there were only a finite number 
of primes of the form 3k + 1, say pi, p2,-.-, Pn, and let x = (2p)... pn)? + 3. 
If p is a prime divisor of x, then we clearly have p # 2,3, p1,..., Pn, so that 
p = —1 (mod 3). Moreover, since 


(2p1..- Pn)” = —3 (mod p), 


we conclude that —3 is a quadratic residue modulo p. We must then have (=) = 1. 


=3 -1 3 -1 = 
ges) Oe aa 
P P P P 
and, by the quadratic reciprocity law, 


(2)(2) __ pV") ep, 


Pp 


However, 


Thus, 


© el hac 


Finally, p = —1 (mod 3) implies, by item (a) of Proposition 12.23, that (4) = 


(=) = —1; in turn, such an equality contradicts the fact that —3 is a quadratic 


residue modulo p. Oo 


Problems: Sect. 12.3 


1. If p is an odd prime, prove that —1 is a quadratic residue modulo p if and only 
if p = 1 (mod 4). 

2. Prove that there are infinitely many integer values of k for which the equation 
x? = y? +k does not admit any integer solutions x, y. 

3. Let a, b and c be given integers, not all of which equal to zero, and n be a 
natural number. If there exist integers x and y, relatively prime with n and such 
that ax* + bxy +cy” =n, prove that b* — 4ac is a quadratic residue modulo n. 

4. (Putnam) Prove that there does not exist x, y € Z for which x* + 3xy — 2y* = 
122. 

5. Given a, b € Z, show that: 


(a) 2b* + 3 has a prime divisor p such that p = +3 (mod 8). 
(b) (2b? + 3) { (a* — 2). 
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10. 


11. 


12. 


13. 


The two coming problems are also concerned with the extended Fibonacci 
and Lucas sequences (Fy)nez and (Ln)nez, as defined in the paragraph 
immediately before Example 12.25. 


. Let (Ln)nez be the extended Lucas sequence. Prove that L, is twice a perfect 


square if and only if n = 0 or +6. 


. Let (Fn)nez be the extended Fibonacci sequence. Prove that F,, is a perfect 


square if and only ifn = 0, +1, 2 or 12. 


. If p #3 is an odd prime, show that 


3\ __f 1, if p=+1 (mod 12) 
~ | -1, if p = +5 (mod 12) ° 


. Let m and n be odd naturals, with n > 1. Prove that: 


(a) 2” — | has a prime divisor p such that p = +5 (mod 12). 
(b) (2"— 1) { 8B" — 1). 


If n > 1 is anatural number for which p = 2” + 1 is prime, do the following 
items: 


(a) Show that 3 is a quadratic nonresidue modulo p. 
(b) Conclude that 3 is a primitive root modulo p. 


With respect to quadratic residues modulo a prime power, do the following 
items: 


(a) Let a and k be given integers, with a being odd. Prove that the congruence 
x* = a (mod 2‘) has an integer solution for every k > 2 if and only if the 
congruence x” = a (mod 8) has an integer solution. 

(b) Let p be an odd prime and a € Z be relatively prime with p. Prove that the 
congruence x* = a (mod p*) has an integer solution for every k > 1 if and 
only if the congruence x* = a (mod p) has an integer solution. 


* Let a and n be relatively prime integers, with n > 1, andn = 2* pi bbe pi be 
the canonical factorization of n into prime powers. Prove that the congruence 
x? = a(modn) has an integer solution if and only if the following conditions 
are satisfied: 


(i) a = 1(mod2) if k = 1,a = 1 (mod 4) if k = 2 ora = 1 (mod 8) ifk > 3. 
pi —1 
(ii) a> =1(mod p;), for 1 <i <t. 


(APMO) A teacher gathered his 1 students around a circle. Then, he chose one 
of them, gave him/her a candy and, proceeding in the counterclockwise sense, 
distributed candies to the students according to the following rule: he jumped 
one student and gave a candy to the third one, then jumped two students and 
gave a candy to the sixth one, jumped three students and gave a candy to the 
tenth one, and so on, always jumping one student more than the previous time 


310 


14. 


15. 
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and giving a candy to the next one. Find all values of n for which every student 
will have received at least one candy after a finite number of turns of the teacher 
around the circle. 

Given a prime p and relatively prime integers a and n, with n > 1, we say that 
a is an-th residue modulo p if the congruence x” = a (mod p) has an integer 
solution. Generalize Euler’s criterion for quadratic residues in the following 
way: if d = gcd(n, p — 1), then 


-1 
a is an n-th residue modulo p & ad =| (mod p). 


For the coming problem, the reader will need to use some basic facts on the 
elementary symmetric sums of the roots of a polynomial, for which we refer to 
Sect. 16.2. 

(Bulgaria—adapted) The purpose of this problem is to show that, if p > 5 
is prime and k > 1 is integer, then p* divides (7 ) — k. To this end, do the 
following items: 


(a) Until item (f), assume that k is odd. Conclude that 


(‘”) bas" = Few) 
P ~ (p-1)! 


where f(X) = Wa (x + bp + i), is a polynomial with integer 


coefficients and a = eye 


If f(X) = XP"! + ap_2X?-* + ap_3X?3 +--+» + aX + ao, show that 
f(a) — f(—a@) = 2a, (mod Dp) and, then, deduce that it suffices to show 
that a; = 0 (mod p?). 

Use Girard’s relation (cf. Proposition 16.6) to show that condition a; = 


p—1 


0 (mod p?) is equivalent to the congruence et r; = O(mod p), where 


rj = TT eeest (a + i). 
tA. p—j 


(b 


wm 


(c 


wm 


p-l 


(d) Show that, modulo p, equality (a+ j)(a+p—j)rj = IL (a+) implies 


VN 


the congruence Pe = —(p— 1)! (mod p), for 1 <j < Bo 
Conclude from item (d) that r; is a quadratic residue modulo p and r; # 
Pp 


(e 


wm 


rj (mod p), for all 1 <i < j < 2. 


(f) Deduce that, modulo p, we have 


—~|\2 
Piet SP 2 el) |. 


Then, finish the proof for odd k. 
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(g) If k is even, adapt the steps delineated from items (a) to (f), arguing from 
the beginning with the polynomial f(X) = 1 (x + s - i). 


12.4 Sums of Two Squares 


As an application of the ideas developed so far, in this section we characterize the 
natural numbers which can be written as the sum of two perfect squares. We start 
by looking at the case of prime numbers, with the following result of P. de Fermat. 


Theorem 12.31 (Fermat) The following conditions on an odd prime p are 
equivalent: 


(a) p = 1 (mod4). 
(b) —1 is a quadratic residue modulo p. 
(c) pcan be written as the sum of two perfect squares. 


Proof 
(a) => (b): letting p = 4k + 1, it follows from item (c) of Proposition 12.23 that 


(=) =(-) = (-1)* = 1 (mod p) 
D 


and thus —1 is a quadratic residue modulo p. 
(b) = (c): let h € Z be such that h? + 1 = 0 (mod p), and 
A={(x,y); x,y EZ, O<x,y < Jp}. 


By the fundamental principle of counting (cf. Sect.1.1), we have |A| = 
(L/P) + 1)’. Now, since 


2 2 
(LVP] +1) > VP =P 
and there are only p possible distinct remainders upon division by p, the 


pigeonhole principle (cf. Sect.4.1) implies the existence of distinct ordered pairs 
(1, y1), (x2, y2) € A for which 


hx, + y,) = hx2 + y2 (mod p). 


Letting a = |x; — x2| and b = |y, — y2|, we have that both a and b are nonzero and, 
hence, 


2 2 
0 <a? +b? = |x) —x2/* +11 — yal? < /p +/p° = 2p. 
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However, since 


a* +b? = |x, — x2|? + ly — yol? 
= (xy — x2)" + (hx — hx2)? (mod p) 
= (h? + 1)(x1 — x2)? = 0 (mod p), 


the only possibility is that a* + b? = p. 


(c) => (a): let p = a? + b*, with a,b € Z. Since p is odd, we have a even and b 
odd, or conversely. Assume, without loss of generality, that a is even and b is odd. 
Then, Proposition 10.9 gives 
_ 2 2 = 
p=a+b =04+1=1(mod4). 
oO 


Before we can proceed, we need an elementary auxiliary result, usually attributed 
to Euler and known as Euler’s identity. 


Lemma 12.32 Let m and n be natural numbers, both of which can be written as a 
sum of two perfect squares. Then, mn can also be written as a sum of two perfect 
squares. 


Proof Writing m = a* + b* andn = c? +d”, we have 
mn = (a? + b*)(c? +d’) 
= ((ac)* + (bd)”) + (ad)? + (be)*) (12.8) 


= (ac + bd)? + (ad — bc)’. 
a] 


The coming result also credited to Fermat, provides a necessary and sufficient 
condition for a natural number to be written as a sum of two perfect squares. 


Theorem 12.33 (Fermat) A natural numbern > 1 can be written as a sum of two 
perfect squares if and only if the following condition is satisfied: for each prime p 
such that p = 3 (mod 4), the greatest power of p that divides n has even exponent. 


Proof 1n what follows, write 


_ 94 a1 a by b 
N= 2M Dy Pe TY 


with a,a;,b; € Z4 and p;, q; prime numbers such that p; = | (mod 4) and q; = 
3(mod4), forall 1 <i<kandIl <j <l. 


(i) Ifeach b; is even, then n can be written as a sum of two perfect squares: to prove 
this, first note that, by Theorem 12.31, each p; can be written as such; on the 
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other hand, we have 2¢ = (2°/*)?+0? if a is even 2% = (2-1/2)? 4(2@-D/?)2 
ue is odd; also, writing bj = 2c;, withc; ¢€ Zfor1 < j < 1, we have 
=(q, )? + 0°. Therefore, repeated applications of Lemma 12.32 allow us 
. eondiude that 7 can be written as a sum of two squares. 
(ii) If n can be written as a sum of two perfect squares, then each b; is even: it 
suffices to prove that if n can be written as a sum of two squares and b; => 1, 
then b; > 2 and 4, can also be written as a sum of two squares. To this end, if 


n= 24a? withe, deéZ, then c* +d? = 0(modq;). If d = 0 (modq;), then 
c = 0(modq;) and hence n = c 2 ae = 0 (mod); therefore, b; => 2 and 


n c \? a\* 
tone ere med Cosa 
qj qj qj 
By the sake of contradiction, assume that d # 0 (mod q;). Then gcd(d, g;) = 1, 


so that d is invertible modulo q;. Letting f denote its inverse modulo g;, we 
obtain 


(cf)? + 1 = 0(modq;). 


This contradicts Theorem 12.31, for q; = 3 (mod 4). oO 


We finish this section by showing that there is essentially only one way of writing 
a prime congruent to | modulo 4 as a sum of two perfect squares. 


Proposition 12.34 [f p is a prime number of the form 4k +1, then there exist unique 
x,y € N such that x < y and x? + y* = p. 


Proof We already know that there exists at least one pair of natural numbers x and 
y such that x* + y* = p. Assume that a, b is another such pair and note that a, b, x 
and y are all relatively prime with p and less than ,/p. We may then choose integers 
1 <c,z < p such that xz = y andac = b (mod p). 

We first claim that either c = z orc + z = p. Indeed, we have 


‘oe y* = x7 + (xz)* = x*(z7 + 1) (mod P); 


so that z7 = —1 (mod p). Analogously, c? = —1 (mod p), and hence p divides 
22 —c? = (z —c)(z +c), which in turn is the same as saying that p divides either 
z—c or z+c. However, since | < c,z < p,it follows that—p < z—c < z+c < 2p, 
and this (together with the fact that p | (z —c) or p | (+ c)) gives z —c = Oor 
Z+c=p. 

Now, assume that c = z. Then, the choices of c and z guarantee that 


bxz = acy = ayz(mod p), (12.9) 


314 12 Primitive Roots and Quadratic Residues 


and cancelling z we obtain bx = ay (mod p). However, since 0 < a,b, x, y < ,/p, 
we have 0 < bx, ay < p, and then bx = ay. Therefore, 


ay 2 y 2 y 2 
par+y=(=) +y=(2) @+e%)=(F) p, 


which gives y = b and x =a. 
Ifz+c = p, then, arguing as in (12.9), we arrive at bx = —ay (mod p) and, 
hence, at bx + ay = p. Thus, (12.8) furnishes 


p? = (a? + b?)(x* + y”) = (bx + ay)” + (by — ax)* = p* + (by — ax)’, 


from which by = ax. We now obviously get x = b and y =a. Oo 


Problems: Sect. 12.4 


1. The purpose of this problem is to show that there are infinitely many natural 
numbers that cannot be written as a sum of three squares. More precisely, prove 
the following result of Euler: there does not exist integers k, 1, x, y, z such that 
1 > Oand 


eat be SAR YT). 


2. Ifa, b,c € N are such that a(a — 1) = b* +c”, show that a + b is odd. 
x2+y? 
x-y 


is an 


3. (BMO) Find all pairs of distinct natural numbers x and y such that 
integer which divides 1995. 

4. Given a natural n, prove that there exist n consecutive naturals such that none of 
them can be written as the sum of two perfect squares. 

5. The purpose of this problem is to prove a theorem, due to Lagrange, which asserts 
that every natural number can be written as a sum of four perfect squares. To 
this end, we first observe that the following generalization of Euler’s identity 
holds: if two given naturals m and n can be written as sums of four squares, 
then mn can also be written this way. Indeed, if m = a? + b? + c? + d* and 
n=w?+x*+ y* +2’, then a simple (though surely tedious) verification shows 
that® 


mn = (aw — bx — cy — dz)’ + (ax + bw +.cz — dy)* 
(12.10) 
+ (ay —bz+cw + dx) + (az + by — cx +dw)’. 


®For a natural proof of such an identity, we refer the reader to item (f) of Problem 10, page 327. 


12.4 Sums of Two Squares 315 


Given such a result, do the following items’: 


(a) Let p be prime, S = {x*; x € Zp} and S’ = {—1—y; y € S}. Prove that 
SAS #G. 

(b) Conclude that, if p is prime, then there exist x, y,m € Z such that | < m < 
pandx*+y?+1=mp. 

(c) Let p be prime and 1 < m < p be such that mp = a7 + 5 + mA + Soe 


with x1, .x2,x3,x4 € Z. If —2 < y; < is chosen in such a way that 


2 2 
x; = y; (modm), prove that there exists 1 < r < m for whichrp = nea + 
yz + Y5 + YG. 
(d) Complete the proof of Lagrange’s theorem, showing that every natural 
number can written as the sum of four perfect squares. 


6. (IMO shortlist—adapted) The purpose of this problem is to find out all n € N for 
which there exists some m € N such that 2” — 1 divides m2 + 9. To this end, do 
the two coming items: 


(a) By contradiction, show that if 2” — 1 divides m? + 9 for some m € N, then 
n is a power of 2. 

(b) Ifn = 2*, make induction on k to show that there exists mz € N such that 
2" — 1 divides m? + 9. 


7We follow the steps delineated in problem XI.14 of the marvelous book [32]. 


Chapter 13 m®) 
Complex Numbers sei 


It is an obvious fact that the set of reals is too small to provide a complete description 
of the set of roots of polynomial functions; for instance, the function x b> xe + 1, 
with x € R, does not have any real root. Historically, the search for such roots 
strongly motivated the birth of complex numbers and the flowering of complex 
function theory. In this respect, a major first crowning was the proof, by Gauss, of 
the famous Fundamental Theorem of Algebra, which asserts that every polynomial 
function with complex coefficients has a complex root. 

In this chapter, we concentrate ourselves in the construction and discussion of 
the elementary properties of the set of complex numbers, postponing to Sect. 15.1 
the presentation of one simple proof of the aforementioned theorem of Gauss. 


13.1 Basic Definitions and Properties 


As was seen in Chapter | of [8], one generally looks at the set R of real numbers as 
the number line: one starts with a straight line (geometric entity), choosing a point 
in it to correspond to 0; then, one chooses a standard length to correspond to the unit 
of measure, and two distinct rules to operate with two points along the line (such 
operations being called addition and multiplication of real numbers), so that one 
gets a third point as the result of operating (i.e., adding or multiplying) two given 
points. We then call the points along the line as real numbers, assume that they fulfill 
the whole line and check that the operations of addition and multiplication satisfy 
the usual properties of commutativity, associativity etc. 
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The discussion at the former paragraph hints to the following natural question: 
would there be some way of defining two operations with the points of an Euclidean 
plane in a way similar to the ordinary addition and multiplication of real numbers? 
The answer to this question is yes and the resulting set, which we are about to 
describe, is that of complex numbers. 

Look at the cartesian plane as the set R x R of ordered pairs (a, b) of reals. Let 
® and © be the operations with the points of such a plane defined by 


(a,b) @(c, d) = (a+c,b+d) and (a, b)O(c,d) = (ac—bd,ad+bc), (13.1) 


where + and - respectively denote the ordinary addition and multiplication of real 
numbers. 

One can straightforwardly verify that © and © are associative and commu- 
tative operations, and that © is distributive with respect to , i.e., that for all 
a,b,c,d,e, f € R, the following properties hold: 


i. Associativity of © and ®: (a, b) ® ((c, d) P(e, f)) = ((a, b) B(c, d)) P(e, f) 
and (a, b) © (c,d) © (e, f)) = (a, b) © (c, d)) © Ce, f). 
ii. Commutativity of @ and @: (a,b) ® (c,d) = (c,d) ® (a, b) and (a, b) © 
(c,d) = (c, d) © (a, b). 
iii. Distributivity of ® with respect to @: (a, b) © ((c, d) @ (e, f)) = (a,b) © 
(c,d)) ® (a,b) Oe, f)) 


It is also pretty clear (check it!) that (0,0) and (1, 0) are identities for ® and ©, 
respectivamente, i.e., that 

(a, b) ® (0, 0) = (a, b) and (a,b) © (1, 0) = (a, db), 
for all a,b € R. Below, we prove a further important arithmetic property of ©, 
which is known as the cancellation law. 


Lemma 13.1 The following cancellation law holds for ©: 


(a, b) © (c, d) = (0, 0) > (a, b) = (0,0) or (c,d) = (0, 0). 


Proof Since (a, b)©(c, d) = (ac—bd, ad+bc), we getac—bd = Oandad+bce = 
0. Therefore, arguing as in the proof of Euler’s identity (cf. Lemma 12.32), we get 


0 = (ac — bd)* + (ad +. bc)? = (a? +b?) (c? +d’). 


Hence, either a” + b* = 0 or c? + d? = 0, which is the same as saying that either 
(a,b) = Oor (c,d) = 0. oO 


Now, let us look at the real line as the horizontal axis of the cartesian plane, 
which is equivalent to identifying each real number x with the ordered pair (x, 0). 
We have to check whether such an identification is good, in the sense that the results 
of performing © and © with such ordered pairs give (up to identification) the same 
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results we would get if we first performed the usual operations of addition and 
multiplication of real numbers. This reduces to verifying that 


(x, 0) ® (y,0) = («+ y, 0) and (x, 0) © (y, 0) = (xy, 0), (13.2) 


which is totally immediate. 

Thanks to the above computations, together with the fact that x t» (x, 0) is 
injective, we can safely consider R, furnished with its usual operations of addition 
and multiplication, as a subset of R x R, furnished with the operations @ and © 
defined as in (13.1). This also allows us to call the points of the cartesian plane as 
numbers, more precisely as the complex numbers. From now on, we shall denote 
the set of complex numbers by C. 

In what follows, we shall derive an easier way to represent the set of complex 
numbers and its operations. To this end, let us henceforth denote by 7 the complex 
number (0, 1), calling it the imaginary unit.! 

Letting ~ denote the identification of the points along the horizontal axis with 
real numbers, and taking into account the definition of ©, we are led to conclude 
that 


i? =(0,1)0 (0,1) = (0-0-1-1,0-141-0) = (-1,0) © -1. (13.3) 

Note also that 
(a, b) = (a, 0) © (0, b) = (a, 0) @ (6, 0) © (0, 1)) a + bi, (13.4) 
which is said to be the algebraic form. 

From now on, we shall write simply + and - to respectively denote the operations 
® and ©, and shall write a + bi to denote the complex number (a, b). We shall 
also omit the identifications involved, for, as we shall see in a few moments, there 
is no danger that such a usage induces wrong arithmetic inside C. It thus follows 
from (13.4) that 

a+bi=0<(a,b)=(0,0) Sa=b=0. 


According to (13.3), in the set C of complex numbers the equation 


x°+1=0 


'The names complex and imaginary are rooted in the historical development of complex numbers. 
More precisely, when mathematicians started using complex numbers, even without having a 
precise definition of what they should be, they called them complex or imaginary, in allusion to the 
oddness of guessing the existence of “numbers” whose squares could be negative. 
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has i as a root. As we shall see along this chapter and Chap. 15, this is the core fact 
related to the importance of complex numbers. 

As we anticipated above, a good reason for writing (a, b) € C in the algebraic 
form a + bi is that, upon doing it, we can operate with complex numbers exactly as 
we do with reals; one just has to take into account that i2 = —1. Indeed, we check 
below that the computations performed this way lead to the same results as those 
done by directly using the definitions of + and - (i.e., @ and ©) in C: 


¢ Computation of (a, b) + (c,d): by definition, we have (a, b) + (c,d) = (a+ 
c, b+ d). On the other hand, operating as we would do with reals, we obtain 


(a+ bi)+ (c+ di) =(a+c)+(6+d)i. 


However, since we are writing (a+c,b+d) = (a+c)+ (b+ d)i, both results 
do coincide. 

¢ Computation of (a,b) - (c,d): by definition, we have (a,b) - (c,d) = (ac — 
bd, ad + bc). Operating once more as with real numbers, we get 


(a+ bi)(c + di) =ac+adi+bci + bdi? = (ac — bd) + (ad + bc)i, 
since i2 = —1. Here again we are writing (ac — bd,ad + bc) = (ac — bd) + 
(ad + bc)i, so that both results also coincide. 


Thanks to the above identifications, we can surely write R C C, withx € R 
written in algebraic form as x + Oi. Moreover, by analogy with the addition and 
multiplication of reals, we shall also call the + and - operations on complex numbers 
as the addition and multiplication, respectively. 

Proceeding with the development of the theory, let us introduce in C other two 
operations, this time extending the subtraction and division of real numbers. Before 
we go on, we remark that it is common usage to write z, w etc to refer to general 
complex numbers, so that each one of them has its own algebraic form. 

Given z, w € C, subtracting w of z means obtaining a complex number z — w 
(the difference between z and w) such that z = (z—w)+ w. Writing z = a+ bi and 
w = c-+di and operating as with reals, we easily see that the only possible way of 
defining z — w is by letting 


z-w=(a+bi)—(c+di)=(a-—c)+(b-d)i. 
Formally, for z = a+ bi and w = c + di, the complex number z — w is defined by 
z—-w=(a—c)+(b-d)i, (13.5) 
being called the difference between z and w, in this order. 


Also for z,w € C, with w ¥ 0, dividing z by w means obtaining a complex 
number z/w (the quotient between z and w) such that z = (z/w) - w. Letting 
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z=a+bi and w = c+ di, and operating as we do upon rationalising real number, 
we immediately get 


z _atbi | (a+bi)(c—di) 


w ctdi  (c+di)(c—di) 
7 (ac + bd) + (be — ad)i 
~ c+ d? 
act+bd  bc—-ad. 
= + 1. 
c?2 + d2 c2 + d? 


as the only possibility for z/w. 
Formally, for z = a+ bi and w = c+ di, with w ¥ 0, the complex number z/w 
is defined by 


Zz ac+bd  bc—-ad. 


wc +d? 7 age 


(13.6) 


being called the quotient between z and w, in this order. 

The particular case of dividing | by a nonzero complex number leads us to 
conclude that every complex number z # O possess an inverse with respect to 
multiplication. Writing z = a + ib ¥ 0, it follows from (13.6) that such an inverse, 
which we shall denote by z~! or 1/z, is given by 


=] a b ‘ 
— l. 
epee gea he 


Once again, note that we need not worry about memorizing the above formulas. It 
just suffices to operate as we do with real numbers. 

In order to simplify lots of forthcoming computations, we now introduce the 
following notation: for z = a + bi € C, we write Zz to denote the complex number 
Z =a — bi, and call it the conjugate of z. In particular, we have Z=2Z. 

Note also that, upon multiplying z by Z, we get as result the real number a? + b?, 
for 


z-Z=(at+bi)(a — bi) = a? — (bi)® =a? +b’. 
We write |z|) = J/a* +? and call |z| the modulus of z. When z € R, it is 
immediate to note that the notion of modulus of a complex number, as defined 


above, coincides with the usual notion of the modulus of a real number (cf. [8], 
for instance). In short, 


g=atbis le? =z=a? +b’. (13.7) 
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Fig. 13.1 Conjugation of 
complex numbers in the 
complex plane 


Yet another simple fact concerning the modulus of a complex number is that 
|z| = |z|. This will become geometrically apparent quite soon (cf. Fig. 13.1), 
provided we go back to the very definition of C, looking at complex numbers as the 
points of the cartesian plane. Upon doing this, we obtain a geometric representation 
of C, which is known as the complex plane.” 

In the complex plane, the horizontal and vertical axes are respectively called 
the real and imaginary. The real axis is this formed by the complex numbers 
which are actually real numbers (i.e., by the ordered pairs (x, 0) © x), whereas the 
complex numbers along the imaginary axis are those of the form yi, with y € R 
(this corresponds to the ordered pairs (0, y) = (y, 0) - (0, 1) © yi); such complex 
numbers are called pure imaginary (so that i is one of them). 

Also with respect to the complex plane, since z = a+ bi = (a, b), it follows that 
z and z = a — bi are symmetric with respect to the real axis (cf. Fig. 13.1); hence, 
|z| = |z|. On the other hand, the real numbers a and b are respectively called the 
real and imaginary parts of z, and are denoted 


a= Re(z), b=Im(z). 


Thus, z = Re(z) + i Im(z), so that z = Re(z) — i Im(z) and, hence, 


ZZ C= Zz 


Re(z) = and Im(z) = ra (13.8) 


Also called the Argand-Gauss plane, in honor of the amateur Swiss mathematician of the 
eighteenth century Jean-Robert Argand, and of the great J. C. F. Gauss. 
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The following lemma collects further useful properties of the conjugation of 
complex numbers. In order to properly state it, we observe that the associativity 
of the multiplication of complex numbers guarantees the well definiteness of the 
n—th power z", for z € Candn € N, as z! = zand 


LS eee MEG 


n times 


for n > 1. Moreover, such a definition can be easily extended to integer exponents 
n by setting, for z € C \ {0}, z° = 1 and, for an integer n < 0, 


zn = (ae ie = 


—n-* 


Then, easy inductive arguments allow us to show that the usual arithmetic rules 
for powers remain valid, namely: 


(z”)" = in and (zw)” = zy” (13.9) 


for all z, w € C \ {0} and m,n € Z. 
We also take the opportunity to point out (although this would not be needed in 
the coming lemma) that the usual binomial formula, 


«+w)"=)> (Jet 


k=0 


(for all z, w € C \ {0} such that z+ w # 0, and every n € N), also holds true in 
C, since its proof only depends on properties of addition and multiplication of real 
numbers that also hold true in C. 

We can finally state and prove the aforementioned result. 


Lemma 13.2 If z and w are nonzero complex numbers, then: 


(a) zERS Re(zZ) =0 86 z=. 

(b) zt wW=Z+WU, Zw =Z- Wand Z/w =Z/W. 
(c) 2? = (z)", for everyn € Z. 

(d) |zZh =187Z=1/z. 


Proof 


(a) Letting z = a+ bi, we have 


zEeRSdD=0Ssatbiz=a-bi Sz=Z. 


324 13. Complex Numbers 


(b) Writing z=a-+bi and w = c+ di, we compute 


Z+w =(a—bi) + (c—di) 
=(a+c)—(b+d)i 


=z+w 


and 


zw = (ac —bd)+(ad+be)i 
= (ac — bd) — (ad + be)i 
= (a —bi)(c— di) = Z-W. 


From this last one, it follows that 


z/w-w=Zz/w-w=Zz 
and, hence, 
Z/w=Z/w. 


(c) Forn = 0, the result is immediate and, for n € N, it easily follows by induction 
from (b). For an integer n < 0, first note that, by item (b), we have 


l=l=zz!=7z-71, 


so that z-! = (z)~!; hence, letting u = z7!, the first part above, together 
with (13.9), furnishes 


Zou" =@)"=(@ 1) 7 = —™ = ow". 
(d) Since (cf. (13.7)) z-Z7= Iz|, we conclude that 


ZJH=loz-z7=1S67=1/z. 
oO 


Example 13.3 (Spain) If z and w are complex numbers of modulus | and such that 


Z+w 
zw # —1, show that >; 1s real. 


Proof Letting a = it oT , item (a) of the previous lemma guarantees that it is enough 


to show that a = a. To this end, note that, by applying items (b) and (d) of that same 
result, we get 
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Oo 


Our next result gives quite a useful geometric interpretation of the modulus of 
the difference of two complex numbers. 


Proposition 13.4 Given z, w € C, the real number |z — w| equals the Euclidean 
distance from z to w in the cartesian plane subjacent to the corresponding complex 
plane. 


Proof Ifz=a+bi and w =c+di, then 


Iz—w| =|(a—c) + (b—d)i| = Va—o)? + (b—a)?. 


On the other hand (cf. Fig. 13.2), basic Analytic Geometry (cf. Chapter 6 of [9]) 
assures that the distance from z to w is also given by V(a —c)*+(b-d). oO 


The coming inequality (13.10) is known as the triangle inequality for complex 
numbers. 


Corollary 13.5 fu, v and z are any complex numbers, then 
ju—v| < |ju—zl\+ |z—vj. (13.10) 


Proof According to Proposition 13.4, the corollary only says that the length of any 
side of a (possibly degenerate) triangle does not exceed the sum of the lengths of 
its other two sides. However, we already know this to be true (cf. Theorem 2.26 of 
[9]). Oo 


Fig. 13.2 Modulus of the 
difference between two 
complex numbers 
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1. With respect to the operations of addition and multiplication of complex 
numbers, check that they are associative and commutative, as well as that 
multiplication is distributive with respect to addition. Check also that (0, 0) 
and (1, 0) are unit elements for the addition and multiplication, respectively. 

2. * Given z, w € C, prove that: 


(a) |zw| = |z| - |w| and (if w 4 0) |z/w| = |zI/lwl. 
(b) |z + wl? = |z|? + 2Re(Zw) + |w|?. 


3. * Use item (b) of the previous problem to establish, for all z, w € C \ {0}, the 
validity of the inequality |z+w| < |z|-+|w|, with equality if and only if z = Aw, 
for some 4 > 0. This inequality is also known as the triangle inequality for 
complex numbers. Use it to: 


(a) Deduce the validity of (13.10). 
(b) Prove that ||z| — |w|| < |z — w|, forall z, w € C. 


4. Fora given z € C, show that |z| = 1 if and only if there exists x € R for which 
1=ix 


a= Ie 
5. (OCM) Let a and z be complex numbers such that |a| < 1 and az # 1. If 


Z—a 


<1, 


l-az 


prove that |z| < 1. 
6. Leta € Cand R > 0 be such that |a| # R. Let I be the circle of center a and 


radius R in the complex plane (so that 0 ¢ T°), and PF = {2; ZE ae Show that 


l’ is the circle of center eae and radius ia ee 
For the coming problem, we define a sequence of complex numbers as 
a function f : N — C-. As in our discussion of sequences of real numbers, 
performed in [8], given a sequence f : N > C we write z, = f(n) and refer 

to the sequence f simply by (Zn)n>1. 
7. (Netherlands) The sequence (z;)x>1 of complex numbers is defined, for k € N, 


by 


k : 
i 
I] ( vi 
with i representing the imaginary unit. Find all n € N for which 


n 
Yo leet — Ze] = 1000. 
k=1 
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8. Given z, w € C, let uw and v be the vectors starting at 0 and having z and w 
as endpoints, respectively. Prove that the complex numbers z + w and z — w 
are the endpoints of the vectors w+ v and u — v, respectively, both of which 
starting at 0. 

For the coming problem, recall (cf. Sect. 4.3) that a (partial) order relation 
in a nonempty set X is a relation < in X which is reflexive, transitive and 
antisymmetric; moreover, if we have x < y or y x x for all x, y € X, then the 
order relation < is said to be a total ordering of X. 

9. Prove that the set of complex numbers cannot be totally ordered. More 
precisely, prove that there does not exist, on C, a total ordering <, which 
extends the usual order relation in R and satisfies the conditions 


Oxzw>O0%K z+ u, Zw. 


The coming problem generalizes the construction of C, presenting the set H 
of Hamilton’s’ quaternions (also called quaternionic numbers). 
10. * InH = {(a, b,c, d); a,b, c,d, € R}, let @ and © be the operations defined, 
for a = (a,b, c,d) and B = (w, x, y, Z), by 


a@®p=(a+w,b+x,c+y,d+2) 
and 
a© Bp = (aw — bx —cy —dz,ax+bw+cz—dy, 
ay —bz+cw+dx,az+by—cx+du), 


where + and - respectively denote the usual operations of addition and 
multiplication of real numbers. Such operations are called the addition and 
multiplication of H. With respect to them, do the following items: 


(a) Show that the function: : R — H, given by (x) = (x, 0, 0, 0) preserves 
operations, in the sense that 


U(x) u(y) = u(x + y) and u(x) Ol(y) = L(+ y), 


for allx,y ER. 

(b) Show that the functions 11, 12,43 : C — H, given by (x + yi) = 
(x, y, 0, 0), t2(x+ yi) = (x, 0, y, 0) and 13(x+ yi) = (x, 0, 0, y), preserve 
operations, in the sense that 


Uj(Z) Buj(w) =1jZ+w) and 1j(z)Olj(w) =1jZ+w), 


for all z, w € Cand1 <j <3. 


3 After William R. Hamilton, Irish astronomer, mathematician and physicist of the nineteenth cen- 
tury. Quaternions have many important applications in Mathematics and Physics, but unfortunately 
they lie far beyond the scope of these notes. 
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11. 


(c) 


(d 


wa 


(e) 
(f) 


(g) 
(h) 


13. Complex Numbers 
In view of items (a) and (b), we hereafter write simply + and - to denote 


® and ©, respectively, x to denote (x,0,0,0) andi = (0,1,0,0), 7 = 
(0, 0, 1,0) and k = (0, 0, 0, 1). Under these conventions, show that 


(13.11) 


ij=—jizk, jk = —-kj =i,ki = -—ik=j 
and 
(a,b,c,d) =a+bi+cj +dk. 


Show that the results of (a + bi + cj + dk) + (w+ xi + yj + zk) and 
(a+ bi+cj + dk)-(w+xi + yj + zk) are the same we would have 
obtained by using the usual operations of addition and multiplication of 
real numbers, together with (13.11). From this, conclude that: 


i. The addition of quaternions is commutative, associative and has 0 as 
identity. 

ii. The multiplication of quaternions is associative, distributive with 
respect to + and has | as identity; nevertheless, it is not commutative. 


Fora =a+bi+cj+dk € H, leta = a — bi — cj — dj be the conjugate 
of a. For a, B € H, show that aB = af. 

For a = a+bi+cj+dk € H, let |a| = /a? + b? + c2 + d? be the norm 
of a. Show that w@ = |a|* and |aB| = |a||A|, for all w, 6 € H. 

Use the result of the previous item to deduce identity (12.10). 

Conclude that, given a € H \ {0}, there exists a single 6 € H for which 
af = 1. Then, show that the following cancellation law holds in H: if 
a, 6 € Hare such that a6 = 0, thena = Oor B = 0. 


(Japan) Let P be a convex pentagon whose sides and diagonals have, in some 
order, lengths equal to /1, Jo, ..., lio. If all of the numbers us ce ee ie are 


rational, prove that / io is also rational. 


13.2. The Polar Form of a Complex Number 


Given z = a+ bi € C \ {0}, let a € [0, 277) be the least value, in radians, of the 
trigonometric angle measured from the positive real axis to the half line joining 0 to 
z (cf. Fig. 13.3). 


Writing 


el ( ree ) 
Z= (|Z Uy], 
Vaz+b2 Ja? +b? 
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Fig. 13.3 The polar form of 
a complex number 


we easily get 


b 
a + be [q2 + 92’ 


a . 
cosa = ———— and sina = 


so that 
z= |z|(cosa +i sina). (13.12) 


Since sin(a + 2k) = sina and cos(a + 2km) = cosa for every k € Z, 
equality (13.12) continues to hold with aw + 2kz in place of a. For this reason, 
from now on we say that the real numbers of the form a + 2kz, with k € Z, 
are the arguments of the nonzero complex numbers z, and that @ is its principal 
argument. 

Letting a be any argument of z € C \ {0}, we shall call (13.12) the polar, or 
trigonometric, form of z. Note that 


|cosa +isina| = 1, (13.13) 


for every a € R. Hereafter, we shall denote the complex number cosa@ + i sina 
simply by cisa. Thus, letting @ be an argument of z € C \ {0}, it follows 
from (13.12) that 


Z= |z| cisa. 
The coming example presents an interesting, albeit elementary, use of the notion 


of argument of a nonzero complex number. 


Example 13.6 Among all complex numbers z for which |z — 25i| < 15, find the 
one having smallest principal argument. 


Solution The complex numbers satisfying the given condition are those situated on 
the closed disk bounded by the circle of center 25i and radius 15. Among these, the 
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one with smallest principal argument, say z, is easily seen to be such that the half 
line starting at 0 and passing through z is tangent to the aforementioned circle in the 
first quadrant of the cartesian plane (see the figure below). 


Im 


a 


Since the radius of the circle is 15, Pythagoras’ Theorem applied to the right 
triangle with vertices 25i, z and 0 gives |z| = 20. Now, letting z = a + bi, we have 
that a equals the height of such a right triangle relative to its hypotenuse. Therefore, 
the usual metric relations on right triangles (cf. Chapter 4 of [9], for instance) give 
25a = 15 - 20, so that a = 12. From |z| = 20 we get 


207 = [z|* =a? +b? = 127+ d? 


and, since z belongs to the first quadrant, b = 16. Hence, z = 12 + 16i. oO 


Back to the development of the theory, we now present the computational 
advantages of the polar form of nonzero complex numbers. We start with a simple, 
albeit very important result. 


Proposition 13.7 Given a, 8B € R, we have 
cisa- cisB = cis(a+ B). (13.14) 
Proof It is enough to compute 


cisa- cisB = (cosa +isina)(cos 6B +i sin B) 


= (cosa cos § — sina sin B) +i(cosa@ sin B + cos f sing) 
cos(a + 6B) +i sin(a + 8) 
cis (a + B), 


where in the third equality above we applied the usual addition formulas of 
Trigonometry (cf. Chapter 7 of [9], for instance). oO 
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Fig. 13.4 Geometric 
interpretation of the 
multiplication by cis a 


The coming corollary provides the usual geometric interpretation for the multi- 
plication of complex numbers, one of which of modulus 1. For the general case, we 
refer the reader to Problem 1. 


Corollary 13.8 Let a be a given real number and z € C \ {0}. If u is the vector 
that goes from 0 to z, then the point of the complex plane representing (cis a) - z is 
the endpoint of the vector obtained from u by means of a trigonometric rotation* of 
center 0 and angle a (cf: Fig. 13.4). 


Proof Letting z = |z| cis @, we have from (13.14) that 
Z:- cisa = |z|cisO@- cisa = |z|cis(0+ a). 


Now, the description of the polar form assures that this last complex number is 
precisely the endpoint of the vector obtained from wu by means of a trigonometric 
rotation of @ radians. Oo 


Formula (13.15) below, known as the first de Moivre’s formula, gives further 
insight on the computational advantages of the polar representation of nonzero 
complex numbers. 


Proposition 13.9 (de Moivre) Jf z = |z| cisa@ is a nonzero complex number and 
n € Z, then 


z” = |z|” cis (ner). (13.15) 


Proof The case n = 0 is trivial. For the time being, assuming we have proved the 
formula for n > 0, let us show how to set its validity also for n < 0. To this end, 


Recall that this means we rotate u through an angle of o radians, in the counterclockwise sense if 
a > 0 and in the clockwise sense if a < 0. 
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let n = —m, with m € N. Given 6 € R, it follows from item (d) of Lemma 13.2, 
together with |cis | = 1, that 


(cis6)~! = cis? = cos@ +isin@ = cos@ —isin@ 
(13.16) 
= cos(—0) + i sin(—@) = cis (—6@). 


Therefore, since we are assuming the validity of (13.15) with m in place of n, we 
get 


n 


Zz i zm 


= (|z|cisa)~” = |z|~" (cis (ma))~! 
= |z|”" cis (—ma) = |z|" cis (na). 


For n > O, let us make induction on n, the case n = 1 being trivial. Assuming 
that the result holds true for some n € N and invoking (13.14), we obtain 


tl 7.2" = |z\cisa - |z|" cis (na) 


ae 


= (Zz cis@ - cis (na) 


= |z|"*! cis(n + la. 
oO 


Let us now compute the arguments of the product and the quotient of two nonzero 
complex numbers. 


Corollary 13.10 [fz = |z| cisa and w = |w|cis B are nonzero complex numbers, 
then 

m4 lel cg 
zw = |zwi|cis(a+ B) and — = —. cis(a— B). 
w 


|w| 


In particular, a + B (resp. a — B) is the measure, in radians of an argument for zw 
(resp. for z/w). 


Proof Let us do the proof for =, the other case being entirely analogous. To this 
end, it suffices to successively apply (13.16) and (13.14) to get 


z i a ee rd : 
~ = [z|cisa - |w|~! cis (—B) = ae cis (a — B). 
w |w| 


oO 


Given n € N and z € C \ {0}, we say that a complex number w is an n-th root 
of z provided w” = z. In contrast with what happens with real numbers, we shall 
see next that the number of -th roots of each nonzero complex number is exactly n. 
Nevertheless, whenever there is no danger of confusion, we shall write simply 2/z 
to denote any such root. In any case, formula (13.17) below, known as the second 
de Moivre’s formula, teaches us how to compute all of them. 
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Proposition 13.11 (de Moivre) Jf z = |z| cis@ is a nonzero complex number and 
n € N, then there are exactly n distinct complex values for the n-th root of z. 
Moreover, such values are given by 


2k 
We - cis (~~). 0<k<n,keN, (13.17) 


where ./|z| stands for the usual positive real root of |z\. 


Proof Writing w = r cis @, we have 


w" =z }(rcisé)” = [z|cisa 
& r" cis (nO) = |z| cisa 


Sr" =|z| and n€ =a4 2kn, Jk eZ. 


ne ‘ . _ on — a+2kn 
In turn, these last two equalities take place if and only ifr = ¥/|z| and@ = =, 


for some k € Z. Hence, there will be as many distinct n-th roots of z as there are 


distinct complex numbers of the form cis (cee 


). Now, it is easy to see that 
_ (a + 2k ~fa+2(k +n)0 
cis(———*) = cis(————""*) 


n n 


and 


2k. 21 
cis(@ 7" *) A cis(———* * *) 
n n 


for 0 <k <I <n, so that is suffices to consider the integers k such thatO < k <n. 
i] 


In spite of the above formula, it is worth noticing (and will be clear in a few 
moments) that its major role is theoretical; in other words, it is not always the best 
way of actually computing the roots of a given complex number. This is due to the 
fact that not always the polar form of a nonzero complex number is actually useful 
in computations, as shown by the coming 


Example 13.12 Compute the square roots of 7 + 247. 


Solution If we try to use the second de Moivre’s formula, we ought to start by 
writing 7+ 247 = 25 cisa, with a = arctan #. Then, we invoke (13.17) to compute 


7+ 24i as being equal to either 


Scis (5) or 5cis (- +), 
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We now get to the point: since a = arctan 4 we shall have to compute cos ($) 
and sin (3) by knowing tana = u Although this can be done with the adequate 
trigonometric formulas, it is much simpler to take an alternative path, which we 
shall now describe. 

Write 7 + 241 = (a + bi)’, with a,b € R, so thata+ bi isa square root of 
7+ 24i. Expanding (a + bi)? and equating real and imaginary parts in both sides of 
the equality thus obtained, we get the system of equations 


a~—b*=7 
ab = 12 


Squaring the second equation and substituting a* = b* +7 in the result, we arrive at 
the biquadratic equation (b* +7)b? = 144, so that b* = 9. Now, since ab = 12 > 0, 
we conclude that a and b must have equal signs, so that the possible ordered pairs 
(a, b) are (a, b) = (3,4) or (—3, —4). Therefore, the square roots we are looking 
for are +(3 + 47). oO 


We now present an interesting (and useful) geometric interpretation of the n-th 
roots of a nonzero complex number. 


Proposition 13.13 Jf z is a nonzero complex number and n > 2 is natural, then, in 
the complex plane, the n-th roots of z are the vertices of a regular n-gon with center 
at the origin. 


Proof Letting a be an argument of z, it follows from the second formula of de 
Moivre that the n-th roots of z are the complex numbers Zo, Z1, ..., Zn»—1 such that 


(a+ 2kr 
sa al cio( +2), 


forO<k <n. 
From (13.13), we obtain 


n : a + 2k n 
ical = Viel] cis (———*)| = cl, 


so that all of the points zz, lie on the circle of the complex plane centered at 0 and 
having radius ¥/|z|. 
On the other hand, it follows from Corollary 13.10 that 


Zk+1 _ (20 
ATL = cis (=) : 
Zk n 


for 0 < k <n. Therefore, letting ux; denote the vector that goes from 0 to zx, 
Corollary 13.8 assures that the angle between wx and ux+1, in radians and in the 
counterclockwise sense, is equal to an 

The result follows immediately from these two facts. Oo 
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Fig. 13.5 Geometric Im 
configuration of the fourth 
roots of —1 


Re 
%3 24 


Example 13.14 Figure 13.5 represents the fourth roots of —1. Observe that z; = 


—Tecis 2 = Lt 
V|— Ilcis = Woe 


As an important particular case of the discussion of roots of complex numbers, 
we say that the complex number is a root of unity if there exists a natural number 
n such that w” = 1. In this case, w is said to be an n-th root of unity. 

Since 1 = cis0, the second formula of de Moivre teaches us that the n-th roots 
of unity are the n complex numbers given by 


. (2k0 
i= cis(—*): O<k<n,keZ. (13.18) 
n 


Writing w = cis aH it follows from the first formula of de Moivre that 


2k. 27 \k 
cis(—*) a cis(=) =o. 


n n 


Therefore, the n-th roots of unity are the complex numbers 
1,@,...,@" |, (13.19) 


As we shall see along the rest of this book, roots of unity are quite useful in a 
number of situations, both algebraic and combinatorial. For the time being, a clue of 
why this is so is provided by the following consequence of the previous proposition. 
In order to properly state it, from now on we shall refer to the circle of the complex 
plane with center 0 and radius | simply as the unit circle. 


Corollary 13.15 [fn > 2 is a natural number, then the n-th roots of unity are the 
vertices of the regular n-gon inscribed in the unit circle and having | as one of its 
vertices. 
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Fig. 13.6 Sixth roots of unity 


Proof Apply Proposition 13.13 to z = 1, noticing that 1 is also one of the roots we 
are looking at. oO 


Example 13.16 In the notations of Fig. 13.6 we have w = cis oa — ae, so that 


the complex numbers 1, , ..., w> are the sixth roots of unity. Note also that 1, aw 
and ow“ are the cubic roots of unity, whereas w, w°> = —1 and @ are the cubic roots 


of -1. 


We finish this section by presenting two examples involving roots of unity and 
which are somewhat more algebraic in nature. 


Example 13.17 Givenn €N, find all of the solutions of equation (z — 1)” = z”. 


Solution Since z = 0 is not a solution, the given equation is equivalent to 
n 
(1 - i) = |. Hence, letting w = cis an the above discussion assures that | — L 


is an n-th root of unity, thus being equal to one of the numbers 1, w, w7, ..., @”—!. 
Now, observe that 1 — i = | has no solutions. On the other hand, for 1 < k < 
n — 1 we have w* ¥ 1, so that 1 — ! = w* if and only if z = Fe Thus, the 
solutions of the given equation are the n — 1 complex numbers 


1 1 1 


l-w 1-@’ ? 1-0! 


oO 


For the next example, recall (cf. discussion preceding Problem 7, page 326) that 
a sequence of complex numbers is a function f : N > C, which, most often, will 
be simply denoted by (Zn)n>1, where z, = f(n). This being said, we observe that 
the proof of the formula for the sum of the first n terms of an arithmetic progression 
with common ratio different from 0 and 1 (cf. item (b) of Proposition 3.12 of [8], for 
instance) still holds, ipsis literis, for a geometric progression of complex numbers, 
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ie., a sequence (Z,)n>1 in C such that z,41; = qzz for every k > 1, withg € 
C \ {0, 1}. Hence, we can safely skip the proof of the following auxiliary result. 


Lemma 13.18 For z € C \ {0, 1}, we have 


Ze) 
+ zgtote $e) = 


As a direct corollary of such a result, we note that if @ ~ 1| is an n-th root of 


unity, then w” = | and, hence, 


lt+oto*+---+o0"! = 


We shall use this formula several times along this text. 


Example 13.19 (IMO) Prove that 


TU 20 30 
cos cos + cos = 
7 7 7 


Proof Letting w = cis ae be a seventh root of unity, we get 


2 oes 20 An 67 
Re(@ + w* + @”) = cos + cos + cos 


(13.20) 


7 7 7 
370 W 
= COS cos cos —. 
7 7 7 
On the other hand, since w* - w’~* = 1 and |w*| = 1, item (d) of Lemma 13.2 
furnishes 
“aio = wo. 


Therefore, o+o+o3 =o0+a+04 and, hence, 
Re(@ + e+ w°) = Re(w® tego Ae ae): 
Now, (13.20) gives 
oto +---+@°=-1. 


Taking real parts and using (13.21), we obtain 


(13.21) 


2Re(w + w + w*) = Re(w + w + w*) + Re(w® + @ + w*) = —1. 
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Thus, it follows from (13.21) that 


4 a 30 Retoter oyese 
cos = cos — cos = = e(a+a @ = 5 


Problems: Sect. 13.2 


1. Forr € R\{0}, we define the homothety of center 0 and ratio r as the function 
H, :C — C such that H,(z) = rz for every z € C. For 6 € R \ {0}, we define 
the rotation of center 0 and angle 6 as the function Rg : C \ {0} > C \ {0} 
given by Rg(z) = (cis 9)z, for every z € C \ {0}. 


(a) Let u be the vector starting at 0 and having z as its endpoint. If w = H,(z), 
prove that ru is the vector starting at 0 and having endpoint w. 

(b) If w = rcis@ is a nonzero complex number, prove that for every z € C\ {0} 
we have 


wz = H, o Re(z). 


2. (OCM) Let w be a complex number satisfying w? + w + 1 = 0. Compute the 
value of the product 


27 


fi“) 


k=1 


io) 


. Letn € N bea multiple of 3. Compute the value of (1 + Ja = 1/30". 
4. Solve, in C, the system of equations 


lz = |z2| = |z3) = 1 
Z+2+723=0 
212223 = | 


5. Letn € N anda é€ C \ {0} be given, with |a| > 1. Show that all of the roots 
of the equation (z — 1)” = a(z + 1)” lie on the circle of the complex plane 
2 

oR? 

6. (Ireland) For eachn € N, let a, =n? +n+ 1. Given k € N, prove that there 
exists m € N such that ay_jax = Am. 

7. (Romania) Let p,q € C, with g 4 0, be such that the second degree equation 
x? + px +q? =0 has complex roots of equal modulus. Prove that e ER. 


centered at 


and having radius a 7, Where R = “Jal. 
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8. Let z1, z2 and z3 be three given complex numbers, not all zero. Prove that z1, 
z2 and z3 are the vertices of an equilateral triangle in the complex plane if and 
only if 


2 2 2 
Zy + 29 + 23 = 2122 + 2223 + 2321. 


9. Given an integer n > 2, use complex numbers to prove that 


n-1 n-1 


2 a =e 


10. Given an integer n > 2 anda € R, use complex numbers to compute each of 
the sums below in terms of n and a: 


(a) sina + sin (2@) + sin (3@) +---+ sin (na). 
(b) sin* w + sin?(2a) + sin*(3a) +--+ + sin?(na). 


11. (MO—shortlist) Let g : C > C be a given function and w = cis at For each 
given complex number a, show that there exists only one function f :C > C 
such that 


f(@ + f(@z +a) = g(z), Vz EC. 


12. We are given a regular polygon A; Az... Az, and a point P in the plane. Show 
that 


n n 

—{[—2 2 

) A2x,P = ) Az-1P. 
k=1 k=1 
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We begin this section by illustrating the role of roots of unity as a counting tool. 
Although several applications of algebraic arguments in Combinatorics will appear 
along the rest of these notes, this is a much broader subject than we will be able to 
present here. For a more comprehensive introduction we refer the interested reader 
to the excellent classical book of Van Lint and Wilson [40]. 

We start by examining, in the coming example, how to use complex numbers as 
tags. 


Example 13.20 We putn > | equally spaced lamps around a circle. At first, exactly 
one of then is ON. An allowed operation on the states of the lamps is to choose a 
positive divisor d of n, with d < n, and change the status of 7 equally spaced lamps 
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(i.e., from ON to OFF or vice-versa), provided all of them have the same status. Does 
there exist a sequence of allowed operations that leaves all of the n lamps ON? 


Solution No! By contradiction, assume, without loss of generality, that the circle is 
the unit circle in the complex plane and the lamps are positioned at the points 1, , 
..,@"—!, where w = cis an Assume also (again with no loss of generality) that 
the lamp which is initially ON is the one standing at 1. 
For every integer k > 0, let a, denote the sum of the complex numbers associated 
to the lamps which are ON immediately before the (k + 1)-th operation is performed, 
so that ag = 1. Along the (k + 1)-th operation, we choose lamps positioned at 


we 
ol, oft4, oft? git G-Nd, 


for some pair (/, d) of integers such that 0 < / < dandd | n, withd < n. Since 
aw" = I, we have 
Ak+1 = ap Col eal gel neg wit (a-)4y 


ol tG-Dd . od — ol 


wt —1 


=aun = dk, 
and it follows that a, = 1 for every k > 0. 
On the other hand, if after m operations all lamps were ON, then we would have 


am =1+o@+---+o0"=0, 


which is a contradiction. oO 


Now, given m,n, p € N, let us consider the problem of assembling an m x n 
rectangle by using rectangular pieces | x p. Counting 1 x | squares, we obtain 
an obvious necessary condition for the proposed assembling to be possible: p must 
divide mn. On the other hand, it is clear that if p divides either m or n, then the 
assembling is always possible: letting, for instance, m = pk, withk €N, it suffices 
to juxtapose k rectangles p x n, each of which assembled by piling 1 pieces 1 x p, 
one on the top of the other. The interesting fact here is that the converse of the 
above discussion holds true, namely: one can assemble the desired rectangle only 
if p divides m or n. This is the content of the coming theorem, which is due to the 
American mathematician David Klarner.> 


Theorem 13.21 (Klarner) Let m,n and p be given naturals. If we can assemble 
anm x n rectangle by using pieces | x p, then p | mor p|n. 


5Cf. Klarner and Gébel [24]. Nevertheless, the proof we present here is different from the original 
one. 
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Proof Assume we can perform such a task, and note first that p < max{m, n}. Now, 
partition the rectangle into m rows 1 x n and n columns m x 1, numbering the rows 
from 1 to m from top to bottom, and the columns from | to 7 from left to right (as 
is usual in an m x n matrix). Then, inside the 1 x | square of intersection of row i 
and column j, write the complex number w't/, where w = cis 2m 

Let us now compute the sum of all of the numbers written in the 1 x 1 squares in 
two different ways (here we have a double counting!). On the one hand, such a sum 


is clearly equal to 


w(w" _ l(a” —1) 
@=1) 


where we have used Lemma 13.18 in the equality above; on the other, since we 
are assuming that it is possible to assemble the rectangle as wished, the numbers 
written on it can be grouped together in sets of p numbers each, corresponding to 
the p horizontal or vertical 1 x 1 squares forming each one of the | x p rectangles 
used. Such sets of p complex numbers give rise to sums of one of the following 


types: 
with 4 gi tGtD 4... 4 @itlte-D 
or 


wits a oitbD+s Sie cet witP—D+i 


according to whether the | x p rectangle is horizontally or vertically positioned, 
respectively. Again by Lemma 13.18, together with the first de Moivre’s formula, 
each of such sums equals 


oe sudo GP sl 
PUG wharke Vogts = ; —0 
o— 


Hence, the sum of all of the 7m numbers written in the rectangle must also vanish. 
Therefore, it follows from (13.22) that (w” — 1)(@” — 1) = 0 or, which is the same, 
that 


_ 2nT _ 2mm 
cis —— = 1 or cis =]. 
P P 
Finally, it is clear from these equalities that p | n or p | m. Oo 


We devote the rest of this section to take a closer look at the structure of the set 
of n-th roots of unity. We start by letting U,,(C) denote it, so that 


Un(C) = {ze C; 22 = 1} = (ok; 1 <k <n}, 


n? 


20 


where wy = cis =. 
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We begin by showing that the intersection of U,,(C) and U,(C) is again a set of 
roots of unity. 


Proposition 13.22. Given m,n € N, one has 

Um(C) A Un(C) = Ua(C), 
where d = gcd(m, n). In particular, ifm and n are relatively prime, then Up(C) 
Un(C) = {1}. 


Proof The second part follows immediately from the first. For the first part, if z € 
Ua(C), then z4 = | and, hence, z” = (z4ym/a = 1. Analogously, z” = 1, so 
that z € U,,(C) NO U,(C). This reasoning shows that Ug(C) C Um(C) NU, (C). 
Conversely, let z € Um(C) M U,(C). Bézout’s theorem assures the existence of 
x,y € Zsuch that mx + ny = d. Therefore, since z” = z” = 1, we get 


2 = zmx-tny = (2™)* (2) =< 


so that z € Ug(C). Hence, Un, (C) N Un (C) C Ug(C), and we are done. oO 


We now specialize to a particular type of root of unity, according to the definition 
below. 


Definition 13.23, A complex number z € U;,(C) is a primitive n-th root of unity 
if 


Un(C) = {z'"; m € N}. 


It follows immediately from the definition that w, is a primitive n-th root of unity. 
On the other hand, the following result completely characterizes all of such special 
roots. 


Theorem 13.24 Given z € U,,(C), the following are equivalent: 


(a) z is primitive. 
(b) There exists an integer 1 < k <n such that gcd(k,n) = 1 andz= ok, 
(c) min{fl e N; 2 =1j=n. 


In particular, there are exactly y(n) primitive n-th roots of unity, where py : N > N 
stands for the Euler function. 


Proof 
(a)>(b): letz = ok, for some integer 1 < k <n. If gcd(k,n) = d > 1, then 
n = dn’ andk = dk’, for some k’,n’ € N. Also, 


3k } 


Pn i 


{z/; JEN= (ok, w* wo 
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Letting m = n'q +r, withg,r € Zand0 <r <n’, we have 


/ dk! k’ 
ot = o” qtr)dk _ wt eal 


_ ork k (n'—)k 
i =o, €{l,@,,...,@, }. 


Therefore, 
i . P n 
#{z4; j eN}<n =< 
(b)=>(c): letz = ok, where | < k < n is an integer relatively prime with n. 
Take / € N for which z! = 1. Then, ok! = | and, thus, n | kl. However, since 
gcd(k,n) = 1, item (a) of Proposition 6.22 assures that n | J; in turn, this gives 
1 > n. On the other hand, since z” = (ok y" = (al)k = 1, we conclude that 
min{/ € N; z! = 1l}=n. 


(c) => (a): assuming that min{/ € N; z! = 1} =n, we first show that, for integers 
l<r<s <n, we have z’ $ z’. Indeed, if we had z’ = z*, then we would have 
z-" = 1, with 1 < s —r <n; but this would contradict the assumed minimality 
of n. It follows from this claim that z, 27, ... , 2” are pairwise distinct n-th roots 
of unity, so that {z/; 7 ¢ N} = U,(C). Then, z is primitive. oO 


In the sequel, we show that primitive roots of unity provide another way to 
establish the multiplicative character of the Euler function. To this end, we need 
a preliminary result which is interesting in itself. 


Theorem 13.25 Let z and w be primitive roots of unity, of orders m and n, 
respectively. If gcd(m,n) = 1, then zw is a primitive root of unity of order mn. 


Proof Since z” = w" = 1, we have (zw)’"” = (z’")"(w")” = 1, so that zw € 
Umn(C). Now, take / € N such that (zw)! = 1. Then, 


z= (wl)! € Um(C) A Un(C) = {1}, 
where we used Proposition 13.22 in the last passage above. Therefore, z/ = 
(wy = 1. 

Now, the previous theorem, together with the fact that z and w~! = w”~! are 
primitive roots of unity of orders m and n, respectively, guarantees that / | m and 
1 | n. However, since gcd(m,n) = 1, item (e) of Proposition 6.22 gives mn | I. 
Hence, / > mn, so that the equivalences of the previous theorem imply that zw is a 
primitive root of unity of order mn. oO 


For our final result, we let U,(C)* denote the subset of U,,(C) formed by the 
primitive roots of unity of order n: 


U,(C%* = {wk; 1<k <n and ged(k,n) = 1). 
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Theorem 13.26 [fm and n are relatively prime natural numbers, then the map 


Um(CY* x Un(CY —> Umn (Cy 
(z, w) > zw 


is a well defined bijection. In particular, p(mn) = g(m)g(n), where gy :N > N 
stands for the Euler function. 


Proof The well definiteness of the given map follows immediately from the 
previous result. On the other hand, if we show that it is a bijection, then two 
applications of the last part of Theorem 13.24 will furnish 

(mn) = |Umn(C)*| = |Um(C)* x Un(C*| 


= |Um(C)*| - |Un(C)*| = gvm)eca). 
We now show what is left, in two steps: 
(i) The map is injective: take zj,z2 € Um(C)* and w1, w2 € U,(C)* such that 
Z1W 1 = Z2W2. Since gcd(m, n) = 1, we have 
zy lz1 = wow)! € Um(C) OM Un(C) = {1}. 


Therefore, Zp 1 = wow)! = 1, which is the same as z; = Z2 and w; = w2. 


(ii) The map is surjective: let 1 < k < mn be an integer such that gcd(k, mn) = 1. 
We wish to find nonzero integers x and y such that gcd(x, n) = gcd(y, m) = 1 
and, letting z = @m, w = w*, we have zw = wk, (why does this suffice?). To 
this end, we must have 


. 2(mx +ny)r _ Qk 
cis ——————— = cis —_,, 


mn mn 


which happens if mx-++ny = k. However, since gcd(m, n) = 1, Proposition 6.26 
assures that such integers x and y do exist. Oo 


We shall have more to say on the combinatorial and algebraic aspects of roots of 
unity in Sects. 14.1 and 20.2, respectively. 


Problems: Sect. 13.3 


1. Give an example of an infinite set 7 C C satisfying the following conditions: 


(a) For each z € 7, there exists n € N such that z” = 1. 
(b) For all z,w € 7, wehave zweT. 
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2. (Croatia) Let n be a given natural number and A be a set of n nonzero 
complex numbers having the following property: if any two of its elements (not 
necessarily distinct) are multiplied, we obtain another element of it as result. Find 
all such possible sets A. 

3. * Show that, ifn > 2, then the product of the primitive n-th roots of unity is 
equal to 1. 

4. A rectangle is assembled by using a combination of pieces 1 x m or 1 x n, where 
m and n are given natural numbers. 


(a) If d = gcd(m, n), prove that d divides at least one of the dimensions of the 
rectangle. 

(b) Prove that there exist arbitrarily large rectangles that not necessarily can be 
assembled by using only pieces 1 x m or only pieces | x n. 


5. We cover an 8 x 8 chessboard using twenty two pieces, with twenty one of them 
being | x 3 rectangles and one being an | x 1 square. Find all possible positions 
of the | x 1 square. 

6. Prove the following extension of Klarner’s theorem: given natural numbers m, 
n, p and q such that 1 < q < min{m,n, p}, we can assemble an m x n x p 
parallelepiped by using pieces | x 1 x q if and only if g divides m, n or p. 

7. We can further generalize Klarner’s theorem as follows: given m,,...,m™n € N, 
let 


A=Im, X++:X In,» 


be the set of all sequences (x1,...,X,) of positive integers such that 1 < x, < 
mg forl < k <n. A(,...,1, p)-block in A is a subset of A formed by p 
sequences (X11,.--,X1n),-++> (Xpi,-+++Xpn) Satisfying the following condition: 


there exists an integer | < k <n such that: 


(a) x17 =X2j = +++ =Xpj foreveryl < jn, j Fk. 
(b) (1K, X2k, ++, Xpx) iS a Sequence of p consecutive integers. 
We say that A can be partitioned into (1,..., 1, p)-blocks if A can be written 


as the disjoint union of such blocks. Prove that this is possible if and only if p 
divides one of the numbers m1, ..., mp. 

8. (Russia) We wish to partition the set of four-digit natural numbers into subsets of 
four numbers each, in such a way that the following property is satisfied: the four 
number that compose each subset have the same digits in some three decimal 
places, whereas, in the fourth place, their digits are consecutive (for instance, a 
possibility for the four numbers of a subset could be 1265, 1275, 1285 and 1295). 
Prove that there does not exist such a partition. 


Chapter 14 M®) 
Polynomials spooks 


From a more algebraic point of view, real polynomial functions can be seen as 
polynomials with real coefficients. As we shall see from this chapter on, such a 
change of perspective turns out to be quite fruitful, so much that we shall not restrict 
ourselves to polynomials with real, or even complex, coefficients; later chapters will 
deal with the case of polynomials with coefficients in Z,, for some prime number 
p. As aresult of such generality, we will be able to prove several results on Number 
Theory which would otherwise remain unaccessible. Our purpose in this chapter 
is, thus, to start this journey by developing the most elementary algebraic concepts 
and results on polynomials. To this end, along all that follows we shall write K to 
denote one of Q, R or C, whenever a specific choice of one of these number sets is 
immaterial. 


14.1 Basic Definitions and Properties 


In this section we collect the basic definitions and notations on polynomials which 
will be needed to the further developments of the theory. 


Definition 14.1 A sequence (ao, a1, 42,...) of elements of K is said to vanish 
almost everywhere if there exists an integer n > —1 such that 


Ant = An+2 = An43 = +++ = 0. 


In words, the sequence (ao, a1, a2, ...) vanishes almost everywhere if all of its 
terms, from a certain index on, are all equal to zero. For instance, both sequences 


(0, 0,0,0,0,0,...) and (1, 2,3,...,7,0,0,0,...) 
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vanish almost everywhere; on the other hand, the sequence (1, 0, 1,0, 1,...), with 
1’s and 0’s indefinitely alternating themselves, obviously does not vanish almost 
everywhere. 


Definition 14.2 A polynomial on (or with coefficients in) K is a formal sum 


f(X) = ao + aX + aX? + aX? +++: Ya X*, (14.1) 
k>0 


where (ao, a1, 42, ...) is a sequence of elements of K vanishing almost everywhere 
and we make the convention that X° = 1 and X! = X in the above sum. Whenever 
there is no danger of confusion, we shall simply write f, instead of f(X). 


Yet with respect to the above definition, it is worth noticing that the choice 
of X is completely immaterial; in particular, it does not represent a variable, and 
could have been substituted by whichever symbol we wanted, say LU, for instance. 
Also, the correct interpretation of the word formal is that two polynomials f(X) = 
Vso UX* and g(X) = p59 b,X* on K are equal if and only if ax = by for 
every k > 0. 7 

Given a polynomial f(X) = a9 +a,X + ay X* + a3X32+--- on K, we shall 
adopt the following conventions: 


i. The elements a; € K are called the coefficients of /f. 

ii. If a; = 0, we shall omit the term a;X‘ whenever convenient. In particular, 
since the sequence (ao, a), a2, ...) vanishes almost everywhere, there exists an 
integer n > O such that we can write 


FCOS DS. aX. 
k=0 


iii. If aj = +1, we shall write +X’ in place of (+1)X’, for the corresponding term 
of f. 

iv. The polynomial 0+0X +0X?+- -- is said to be the identically zero polynomial 
on K. We shall also sometimes say that such a polynomial vanishes identically. 
Furthermore, whenever there is no danger of confusion with 0 € K, we shall 
write 0 to denote the identically zero polynomial on K. 

v. More generally (and in consonance with items ii. and iv.), given a € K we shall 
denote the polynomial w + 0X + 0X* +--- simply by a, and say that it is the 
constant polynomial a. In each case, we shall always rely on the context to 
make it clear whether we are referring to the constant polynomial @ or to the 
element a € K. 


We let K[X] denote the set of all polynomials on K. In particular, in view of item 
v. above, we also make the convention that 


K Cc R[X]. 
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On the other hand, the inclusions Q Cc R C C furnish the corresponding inclusions 
QLX] c R[X] c C[X]. 


Example 14.3 If f(X) = 1+ X — X34 2X’, then f ¢ Q[X] (for V2 ¢ Q) but 
f € R[X]. On the other hand, the formal sum g = 1+X+X?*+X?+X++..- does 
not represent a polynomial, for the sequence (1, 1, 1,...) does not vanish almost 
everywhere. 


In what follows, we shall use the ordinary operations of addition and multiplica- 
tion on K to define on K[X] operations 
@® : KLX] x K[X] — K[X] and ©: K[X] x K[X] > KX], 
also called addition and multiplication, respectively. To this end, we first need the 
following auxiliary result. 


Lemma 14.4 [f (ax)x>0 and (bx) x>0 are sequences of elements of 1K that vanish 
almost everywhere, then so do the sequences (ax + bx)x>0 and (cx)K>0, where 


k 
Ck = > ajbj = So ajbk-i- 
i+j=k i=0 
i,j=0 

Proof We show that the sequence (cx)x>0 vanishes almost everywhere, leaving the 
other case as an exercise for the reader (cf. Problem 1). Let m,n € Z+ be such that 
qa, = Ofori > nandb; = Ofor j > m.Ifk >m-+nandi+ j =k, withi, j = 0, 
then either i > n or j > m, for otherwise we would have k = i+ j <n-+m, which 
is not the case. However, since i > n > a; = Oand j > m > b; = 0, in any event 
we have ajb; = 0, so that 


Ck= > ab; = 0. 
itj=k 
i,j20 


oO 


With the previous lemma at our disposal, we can finally formulate the definitions 
of addition and multiplication for polynomials. 


Definition 14.5 Given in K[X] polynomials 


F(X) = Sha X* and g(X) = DEX, 


k>=0 k>0 


350 14 Polynomials 


the sum and product of f and g, respectively denoted by f @ g and f © g, are the 
polynomials 


(f ® g)(X) = D(ay + by) X* 


k>0 


and 


(f Og)(X) = doce X*, 


k>0 


with cz, = Vitjne ajb;. 
i,j20 
Even though the definition of the product of two polynomials may not seen to be 
a natural one at a first glance, this is not the case. More precisely, the formula for 
the coefficient c, of f © g is the correct one if we wish © to be distributive with 
respect to ®, as well as the usual rule for powers, X” © X" = X"*", to be valid. 
Indeed, if such properties hold, then computing the product 


(ap + aX +a2X* +--+) © (bo + bX +X? +--+) 


distributively, we obtain 


aobo = > ab; 


i+j=0 
i,j2=0 


for the coefficient of X°, 


agbi + abo = > ajb; 
i+j=l 
i,j=0 
for the coefficient of X, 


agb2 + ab, + azbp = > ajbj; 


i+j=2 
i,j2=0 


for the coefficient of X* and so on. 

Actually, it is not difficult to convince ourselves (cf. Problem 4) that, as 
defined, the operations of addition and multiplication of polynomials on K have 
the following properties: 


i. commutativity: f6g=g@6fandfOg=gOf; 
ii. associativity: (f ®g)@h=f @(g @h) and(fOg)Oh=fO(gOh); 
iii. distributivity: f © (g @h) =(f Og) @(f OA), 


for all f, g,h € K[X]. 
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Example 14.6 Consider the polynomials with real coefficients f(X) = 1+ X — 
J/2X* — 4X3 and g(X) = X + X?, where, according to our previous conventions, 
we omit terms with vanishing coefficients. Then, 


(FOeCO =14 2X4 (0 =0/2)x* 47 
and 


(f © g(X) = (14+ X — V2xX? — 4x7) © (X + X”) 
=[1O(X+ X*)]@[XO(X+ Xe 
@ [=V 2X" OK kX] (4X) 
= (X + X*) @ (X? + X?) @ (-V2X3 — V2x@ 
® (—4X* — 4x?) 
=X +2x?+(1—V2)X? — (V2 +.4)x4 —4Xx°. 


Our discussion up to this point makes it clear that we can relax notations, 
henceforth writing simply + and - to denote the operations of addition and 
multiplication of polynomials. Thus, whenever we add two polynomials, the reader 
must be aware of the fact that the involved + signs stand for two distinct operations: 
addition of elements of K, performed on the coefficients of the polynomials, and 
addition of elements of K[X]. Nevertheless, this will cause no confusion, for the 
context will always make it clear which operation the + sign refers to. Moreover, an 
analogous comment holds for the - sign and the two operations of multiplication. 


Remark 14.7 All of the above definitions and remarks can be extended, in an 
obvious way, to polynomials with integer coefficients. From now on, whenever 
necessary, we shall denote the set of such polynomials by Z[X]. 


Letting 0 be the identically vanishing polynomial, we have f +0 =0+ f =0 
for every f € K[X], ie, 0 is the identity for addition of polynomials. On the 
other hand, given f € K[X], there exists a single g € K[X] for which f + g = 
g+ f = 0; indeed, letting f(X) = a9 +a,X + a)X? +.---, it is immediate that 
g(X) = —ay — a, X — aX 2... is such polynomial, which will henceforth be 
denoted by — f. Thus, 


(— f)(X) = —ay — a, X — aX? 


and, for f, g € K[X], we can define the difference ( — g between f and g by 
ey aoe Sead es 8 
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For a € K and g(X) = bop +b, X +b) X*+--- € K[X], it is immediate to verify 
that 


a-g=aby +ab)X +ab.X*+---; 


in particular, we have 1 - g = g for every g € K[X], so that the constant polynomial 
1 is the identity for multiplication of polynomials. 

From now on, whenever there is no danger of confusion, we shall write simply 
fg, instead of f - g, to denote the product of f, g € K[X]. In particular, if a € K, 
then af will stand for the product of a, seen as a constant polynomial, and /f. 

The coming definition will play a major role along the rest of the book. 


Definition 14.8 Given f(X) = ag +a,X +---+a,X" € K[X] \ {0}, with a, 4 0, 
the nonnegative integer n is said to be the degree of f, and we will write 0f =n 
to denote this. 


According to the above definition, the notion of degree only applies to nonva- 
nishing polynomials. On the other hand, df = 0 for every constant polynomial 
Ff (X) = a, with aw € K \ {0}. From now on, whenever we refer to f € K[X] \ {0} 
by writing 


f (X) =a, X" +---+a,X +40, 


unless explicitly stated otherwise we shall tacitly assume that a, #~ 0. In such a 
case, a, Will be called the leading coefficient of f. Also, f € K[X] \ {0} is said to 
be monic if it has leading coefficient equal to 1. Finally, in the above notations apg is 
said to be the constant term of /. 

The coming proposition establishes two very important properties of the notion 
of degree of polynomials. 


Proposition 14.9 For f, g © K[X] \ {0}, we have: 


(a) 0(f +g) < max{df, dg} if f+ eg 40. 
(b) fg #4 Oand 0( fg) = of + dg. 


Proof Let 0f =n and dg = m, with 


f(X) = ay +ayX +--+ +ayX" and g(X) = bo +b, X +--+ + by X™. 


(a) Ifm ~n, we can assume, without any loss of generality, that m > n. Then 
(f + g)(X) = do + bo) ++++ + (Qn + by) X" + nee ae +++ + bn X", 
so that 0(f + g) =m = max({df, dg}. If m =n but f + g #0, then 


(f + 8)(X) = (ao + bo) + +++ + (Gn + bn) X", 
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and there are two possibilities: a, + by, = 0 or a, + b, ¥ 0. In the first case, 
o(f +g) <n=max({0f, dg}. In the second, 0(f + g) =n = max{df, dg}. In 
any case, we have 0(f + g) < max{df, dg}. 

(b) Let fg =cotcX + c7X? +---. If k > m+n, we saw in the proof of 
Lemma 14.4 that c, = 0. Hence, if we show that cm+, 4 0, we will conclude 
that fg A Oand 0( fg) = m+n = Of + dg. For what is left to do, since a; = 0 
fori > n and b; = 0 for j > m, it is immediate that 


Cnn >) aby = Gahan Z 0. 


itj= m+n 
i,j>0 


oO 


In spite of the fact that the material developed so far is rather elementary, it is 
enough to present yet another use of roots of unity as tags (cf. Sect. 13.3). To this 
end, given f(X) = a,X" +---+a,X + ao € C[X] and z € C, it is convenient that 
we write f(z) to denote the complex number 


f(Z) =anz" +--+ +a1z+ a0. (14.2) 


(We shall have more to say on this substitution of X by z in Sect. 15.1—cf. 
Definition 15.1.) 

Formula (14.3) below is known in mathematical literature as the multisection 
formula. 


Theorem 14.10 For f € C[X]\ {0}, let ay denote the coefficient of X* in f. If n is 
a natural number, w = cis te and z € C, then 


1 
y on" = SFG) f@n 4st fo" |2)). (14.3) 


n|k 
Proof Since f(X) = Dixs0 a;X*, we have 
n—1 


pe Sad 25 yaa i 


j=0 k>0 k>0 j=0 


= = (axz! ve) 


k>0 


(14.4) 


Now, note that if n | k then w/* = 1 for every j € Z, so that pas ~j ork =n.On 
the other hand, if n { k, then ok | and, hence, 


n-1 


nk 
ial = 140k... -@"-DE = 2 = = 0. 
om ok — 1 
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Therefore, (14.4) reduces to 


n-1 


» f(w! Z= > naz’. 


n|k 
oO 


The coming example relies on the fact that, given a, b € K, the binomial formula 
can be used to expand (aX + b)” € K[X] in monomials. The reader is invited to 
check this. 


Example 14.11 (Poland) For each n € N, compute, in terms of n, the value of the 
sum 


Solution Let 


f®M=K+Y"' => (;) 


k=0 


If @ = cis 2, then w is a cubic root of unity and 1 + w + w = 0. By applying the 
mulesection Posts to f, we get 


> (i) = = + (fa) + f+ Fo) 
3k k 3 


1 
=; (2+ @+"+@?+0") 
1 n 2\n n 
= 5 (2" + 0%)" + a) ) 
1 
=o (2" + i'w" + o")) . 
Now, note that if 3 | n, then w” = | and, hence, wo" + ao" = 2; if 3 { n, then 
ow" =o or w”, so that o2” + w" = w2 + @ = —1. Therefore, 
3 n\ — | (2"+2(-1)")/3, if3|n 
ac NE ~ [a+ p"*)/3, if 3¢n 7 
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1. * If (ag)e>o and (bx)x>0 are almost everywhere vanishing sequences of 
elements of KK, show that the sequences (ag, + bx)x>0 also vanish almost 
everywhere. 

2. * Givenn € N anda € K, perform the following products of polynomials: 


(a) (X —a)(X""! +ax"-2 4... 4a"? X +a"~!). 
(b) (Xx + a)(X"—! = axn-2 peer qa"-2X + a’—}), if n is odd. 
(OF Ce DO a)... ce Ee , 


3. Compute the coefficient of X? on both sides of (X+1)"(X+1)" = (X¥4+1)™"™ 
to give another proof of Vandermonde’s identity! (cf. Problem 3, page 48): 


E(t) = ("5") 


with the sum at the left hand side above ranging through all possible nonnega- 
tive integer values of k. 

4. * Show that the operations of addition and multiplication of polynomials are 
commutative and associative, and that multiplication is distributive with respect 
to addition. 

5. * Show that K[X] does not possess any zero divisors. More precisely, show that 
if f, g € K[X] are such that fg = 0, then f = Oor g = 0. 

6. (TT) Find at least one polynomial f, of degree 2001, such that f(X) + fd — 
X)=1. 

7. Letn € N and, for 0 < k < 2n, let ag be the coefficient of X* in the expansion 
of f(X) = (1+ X + X’)". Compute, in terms of n, the value of the sum 


a3+da6+dagt+-:-:-. 


8. * Generalize the multisection formula in the following way’: given f € C[X]\ 
21 


{0}, a natural number n > | and an integerO <r <n—1,if w= cis a and 
z€C, then 
1 n—1 
ke —rj J 
az = o @’Z), 
Tata tYorfoln 
k=r (mod n) j=0 


where a, denotes the coefficient of X* in f. 


' Alexandre-Thedphile Vandermonde, French mathematician of the eighteenth century. 


?For another proof of this result, see Example 18.7. 
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9. Compute, as a function of m € N, the value of the sum 


(Seas) omg) a) 


For the coming problem, given f(X) = Gn X" an) X"—!4---+a,X+ag € 
K[X] and m € N, we write f(X”’) to denote the polynomial in K[X] defined 
by 


Bg MO A Gp] RO tra ge RY he gy 


i.e., obtained by writing X” in place of X in the expression of f. 

10. (Miklé6s-Schweitzer) A polynomial with real coefficients is called positively 
reducible if it can be written as the product of two nonconstant polynomials 
whose nonvanishing coefficients are positive. Let f € IR[X] have nonvanishing 
constant term. If f(X") is positively reducible for some natural number m, 
prove that f itself is positively reducible. 


14.2. The Division Algorithm 


We have already seen that it is possible to add, subtract and multiply polynomials, 
the result being another polynomial. And what about the possibility of dividing one 
polynomial for another? Well, it will not always be possible to do so; in other words, 
given polynomials f, g € K[X] \ {0}, there will not always exist h € K[X] such 
that f = gh; for instance, if df < dg, then such a polynomial h does not exist, for 
otherwise we would have 


af = 0(gh) = dg + dh > Og, 
which is an absurd. 


In spite of the above discussion, the following analogue of the division algorithm 
for integers, called the division algorithm for polynomials, still holds. 


Theorem 14.12 /f f, g € K[X], with g 4 0, then there exist unique q,r € K[X] 
such that 


f =gq+tr, with r =0 or dr < dg. (14.5) 


Proof Let us first show that there exists at most one pair of polynomials g, r € K[X] 
satisfying the stated conditions. To this end, let q1, g2, 71, r2 € K[X] be such that 


f=gent+n =gqt+nre, 
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withr; = Oor0O < or; < 0g, fori = 1, 2. Then, g(g1—q2) = r2—1r; and, if gq; 4 q2, 
Problem 5, page 355, would guarantee that r; # r2. However, Proposition 14.9 
would then give 


dg < 0g + O(q1 — 42) = O(g(G1 — 42)) 


= 0(r] —r2) < max{dr, 0r2} < dg, 


which is an absurd. Hence, g; = q2 and, then, r) = r2. 
Let us now establish the existence of polynomials g and r satisfying the stated 
conditions. To this end, we consider two cases separately: 


(i) If f = 0, take g = r = 0; if g is (a nonzero) constant, say c, take g = c! f 
andr = 0. 
(ii) If f # 0 and g is nonconstant, we shall make induction on df > 0. 


As initial case, if 0f < dg (such an f does exist, for dg > 1), take g = 0 and 
r = f. By induction hypothesis, let m > dg be an integer and assume that the 
existence of q and r has been established for any polynomial of degree less than m 
in place of f. In order to establish the induction step, take f € K[X] \ {0} of degree 
m, and let b ¢ 0 denote the leading coefficient and n the degree of g. If a stands for 
the leading coefficient of f, there are two distinct possibilities: 


© f(X) = ab-!X™-"g(X): in this case, g(X) = ab-!X™~" and r(X) = 0. 
© f(X) # ab~!X™—"g(X): letting h(X) = f(X) — ab~!X"~"g(X), we have 
h #Oand 


ROO) S(aX 425.) Sab 1X +), 


so that dh < m. Therefore, by induction hypothesis, there exists gj, r1 € K[X] 
such that h = gq; +11, withr; = Oor dr, < 0g. Thus, 
F(X) = W(X) + ab“! X"™—" g(X) 
= g(X)qi(X) + 11(X) + ab“! X™—-" 9(X) 
= 9(X)(qi(X) + ab! X"™") + 1(X), 
and it suffices to take g(X) = q1(X) + ab~!X"~ and r(X) =r, (X). 
oO 


In the notations of the division algorithm for polynomials (and by analogy with 
the division algorithm for integers), we say that q is the quotient and r is the 
remainder of the division of f by g. Moreover, when r = 0 we say that f is 
divisible by g, or that g divides f; in this case, as with integers we denote g | f. 


Remark 14.13 As the reader can easily verify by checking the previous proof, 
the division algorithm still holds for polynomials in Z[X], provided the leading 
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coefficient of g is equal to +1. More precisely, if f, g € Z[X], with g 4 0 having 
leading coefficient equal to +1, then there exists unique polynomials g,r € Z[X] 
such that 


f =gqtr, with r =0 or Or < dg. 


In all that follows, we shall use this without further comments. 


In spite of the name division algorithm, the existence part of the presented proof 
for Theorem 14.12 is actually not algorithmic, i.e., it does not provide an actual 
algorithm for finding g and r out of f and g. In order to remedy this, we now revisit 
the existence part of the proof in a more algorithmic way: 


Division Algorithm for Polynomials 


1. DO 


* r<—fim<dfsq <0; 
° a <— LEADING COEFFICIENT OF Fr; 


2. WHILEr #0 OR dr > 0g DO 


© r(X) — r(X) — ab-!X"-"9(X); 
© g(X) <— q(X) +ab7! x"; 

* m<+or 

° a <— LEADING COEFFICIENT OFF; 


3. READ THE FINAL VALUES OF r AND q. 


Let us prove that the above algorithm does stop after a finite number of repetitions 
of the loop WHILE and give us, at the end, polynomials g and r as wished. 

Ifr ~ Oor Or > Og, then the instruction of the WHILE loop substitutes r(X) by 
the polynomial r(X) — ab~!X"~"g(X). Such a polynomial, if it is not identically 
zero, has degree less than that of r, for the polynomial ab~! X"”~" g(X) has degree m 
(which is precisely the degree of r immediately before the execution of the loop) and 
leading coefficient a (which is the leading coefficient of r right before the execution 
of the loop). Thus, the algorithm stops after a finite number of steps. 

After the first attribution, we have 


F(X) = g(X)q(X) + r(X), 


for, at the beginning, g(X) = O and r(X) = /f(X). Assume, by induction 
hypothesis, that after a certain execution of the WHILE loop we have f(X) = 
g(X)q(X) + r(X) and that, at this moment, r(X) 4 O and dr > dg. Then, the 
next step does take place and replaces r(X) by r(X) — ab~!X"~"g(X), and q(X) 
by g(X) +.ab7!X"—", Now, since 


g(X) (q(X) + ab-1X"™—") + (r(X) = ab“!X™—"g(X)) (14.6) 


14.2 The Division Algorithm 359 


equals g(X)qg(X) + r(X), it follows from induction hypothesis that, after such 
additional execution, we will also have (14.6) equal to f(X). 


Example 14.14 Follow the algorithm presented above to obtain the quotient and the 
remainder upon dividing f(X) = X*—2X?+5X +7 by g(X) = 3X7*+1 in Q[X]. 


Solution Following the notations of the previous discussion, we start by storing 
b =3 and n = 2. Then, we have the following steps: 


1. We make the attributions r(X) = X+—2X*+5X +7;m = 4; q(X) =0;a=1. 
2. Since neither r(X) = 0 norm = 4 < n = 2, we substitute r(X) by 


ry ab XS" Fey 


1 
a aes ars eo xP GX? +1) 
7 2 
m by 2, q(X) by 
—lym-n 1 4-2 1 2 
q(X) +ab"!X =0+5x = ak 


and a by -f. 
3. Since we still do not have r(X) = 0 or m = 2 < n = 2, we replace r(X) by 


r(X) — ab! X"™-" 9(X) = 
7 7/3 
= = 0 Loe 474. He x?2ax? +1) 


70 
=5X+_, 
- 9 
m by 1, q(X) by 


gx\ab x" = 


and a by ;- 
4. Now, although r(X) 4 0, we have m = | < n = 2, so that we do not return to 
the beginning of the loop. Instead, we simply read 


(X) =5X+ ue d q(X) lye i 
= — an SAN See, 
' 9 2 3 9 
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The table below looks at the solution of the previous example schematically. In 
order to help the reader in grasping how steps |. to 4. actually took place, we suggest 
that he/she tries to identify each of those steps within the table. 


Division Algorithm for Polynomials 


X* 2K? 45K = -44-:7/3X7 41 


=x? -1x" 1/3X? 
—7/3X? +5X = +7|1/3X* —7/9 
q fox +7/9 
5X +70/9 


In the coming example, given f(X) = a,X"+---+a,X+ao € K[X] andz € K, 
we stick to the convention set in (14.2), of writing f(a) to denote the element 


f(@) =a" +---+aja+ag eR. 


As already quoted there, we refer to Sect. 15.1 for more on such substitution of X 
by a. 


Example 14.15 Given f(X) = a,X”" +---+a,X + a9 € K[X] anda € K, show 
that the remainder upon division of f(X) by X — a is precisely f(a). 


Proof We want to show that there exists g € K[X] such that 


f(X) = (X — a)g(X) + f(@). 
Since this is the same as having f(X) — f(a) = (X — a)q(X), we only need to 
show that X — a divides f(X) — f(a). We do this by invoking the result of item (a) 
of Problem 2, page 355, according to which 
X* — ok = (X — a) (XE 4x2 +... 4 2X + ath, 
for every k € N. Thus, 
n n n 
F(X) = F@) = Dy aX* — DV aga® = Yay (X* — a4) 


k=0 k=0 k=0 


n 
= So ag(X — a) (X*! $axk? +--+"? X 40%!) 
k=0 


n 
= (X -a) Va eg Oe acy 2X a, 
k=0 
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Problems: Sect. 14.2 


1. Given n € N, find the remainder upon division of (X* + X +1)” by X7—X+4+1. 

2. We are given natural numbers m and n, such that n is a divisor of m. Find the 
remainder upon division of X” + 1 by X” — 1. 

3. Given natural numbers m and n, with m > n, find the remainder of the division 
of X2" +1 by X”" +1. 

4. Leta, b € K be given, with a 4 0. For f € K[X] \ {0}, prove that the remainder 
of the division of f by aX + b is equal to f (- By. 

5. Upon dividing f € Q[X] by X +2, we obtain —1 as remainder; upon dividing it 
by X — 2, the remainder is 3. Find the remainder of the division of f by X? — 4. 

6. A polynomial f € R[X] leaves remainders 4 when divided by X + 1 and 2X +3 
when divided by X7+1. Find the remainder of the division of f by (X +1)(X?+ 
1). 

7. Let m and n be given natural numbers such that m = nq +r, with g,r € Z and 
0 <r <n. Find the quotient and the remainder upon dividing X” +1 by X"+1. 


Chapter 15 ®) 
Roots of Polynomials sei 


In all we have done so far concerning polynomials, at no moment we had any intent 
of substituting X by an element of K. Even in Theorem 14.10 and Example 14.15, 
the notation f(z), used to denote the complex number obtained by formally 
substituting X by z in the expression of f € C[X], was a mere convention. This 
is no surprise, for we are looking at polynomials as formal expressions, rather than 
as functions. In this sense, the indeterminate X is a symbol with no arithmetic 
meaning, and we have even stressed before that we could have used the symbol LU, 
instead. 

We remedy this state of things in this chapter, where we introduce the concept 
of polynomial function associated to a polynomial, as well as that of root of 
a polynomial. Among the various important results presented, we highlight the 
presentation of a complete proof of the Fundamental Theorem of Algebra, as well 
as the searching criterion for rational roots of polynomials with integer coefficients, 
along with several interesting applications. 


15.1 Roots of Polynomials 


In order to proceed with our study of polynomials, it is convenient to consider the 
polynomial function associated to a polynomial, according to the following 


Definition 15.1 For f(X) = a,X"” + “+++ aX + a9 € K[X], the polynomial 
function associated to f is the function f : K > K, given for x € K by 


ff) =anx" +--+ + a,x + a0. 
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When f(X) = c, note that the associated polynomial function 7 will be the constant 
function f(x) = c for every x € K, thus justifying the nomenclature constant 
attributed to a polynomial which is either identically zero or of degree zero. 


Definition 15.2 Let f € K[X] be a given polynomial, with associated polynomial 
function f : K > K. Anelement a € K isa root of f if f(a) = 0. 


For instance, if f(X) = ¥?41€C[X], itis easy to see that z = ++i are the 
only roots of f in C. Indeed, the polynomial function associated to f is 


f:Cc—C 
Ze +1’ 


so that 


~. 


f@=0e2=-1ez=Hi. 


The coming proposition is known as the root test. Note that item (a) revisits 
Example 14.15 in a slightly different way. 


Proposition 15.3 If f € K[X] \ {0} anda € K, then: 


(a) a isa root of f if and only if (X —a) | f(X) in K[X]. 

(b) If a is a root of f, then there exists a greatest natural number m such that 
(X —a)” divides f. Moreover, writing f (X) = (X —a)'"q(X), with € K[X], 
one has q(a) # 0, where q : K — K stands for the polynomial function 
associated to q. 

(c) If o1,...,a% € K are pairwise distinct roots of f, then f (X) is divisible by 
(X —ay)...(X — ag) in K[X]. 


Proof 


(a) The division algorithm assures the existence of polynomials g,r € K[X] such 
that 


f(X) = (X — a)g(X) +r (X), 
withr = Oor0 < dr < 0(X—a@) = 1. Hence, r(X) = c, aconstant polynomial. 


On the other hand, letting 7 and q be the polynomial functions respectively 
associated to f and q, it follows from the equality above that 


f(a) = (a —a)g(a) +c =, 
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ie., f(X) = (X — a)q(X) + f(a). Thus, 
aisarootof f > f(a) =0< f(X) = (X -—a)q(X). 


(b) Ifm > df, then (X — a)" { f(X), for A(X — a)” =m > df. Therefore, item 
(a) guarantees the existence of a greatest positive integer m for which (X —a)” | 
Ff (X), say f(X) = (X — a)" q(X), with g € K[X]. In the level of polynomial 
functions, we thus obtain 


Fx) = (x —a@)"G(x), Vx EK. 


Now, if g(@) = 0, it would follow from (a) that g(X) = (X — a)q1(X), for 
some g; € K[X]. However, we would then have 


F(X) = (KX — a) *"91(X), 


a contradiction to the maximality of m. 

(c) For the sake of simplicity, let us establish the result of this item for k = 2 (the 
proof in the general case following by a straightforward inductive argument). 
Since a; is a root of f, item (a) once more assures the existence of g € K[X] 
such that f(X) = (X — a1)g(X). Let g denote the corresponding polynomial 
function. Since a; 4 a2 and a7 is also a root of f, the last equality above gives 


O = f(a@2) = (a2 — a) g (a2), 


so that @2 is also a root of g. Thus, by applying (a) yet another time, we conclude 
that there exists h € K[X] for which g(X) = (X — a2)h(X). In turn, this gives 


S(X) = (X — a1) (X — a2)h(X). 


The coming example is an interesting application of the root test. 


Example 15.4 (Soviet Union) Label the rows and columns of ann x n chessboard 
from | to n, respectively from top to bottom and from left to right. Given 2n real 
numbers a,...,@n,51,..., bn, with b1, ..., by being pairwise distinct, write a; +b; 
inside the unit square situated in row i and column j, for 1 < i,j < n. If the 
products of the numbers written in the unit squares of each column of the chessboard 
are all equal, prove that the products of the numbers written in the unit squares of 
each row are all equal too. 
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Proof Consider the polynomial 
f(X) = (X + a1 )(X + a2)... (X + an) € RX]. 


Letting f be the polynomial function associated to f, it follows from the statement 
of the problem the existence of a € R such that f (bi) = f (b2) — f (bn) =a. 
Then, F (b;) —a = Ofor 1 <i <n and, since bj, ..., by are pairwise distinct, item 
(c) of the root test gives 


F(X) — aw = (X — bi )(X — bp)... (X — bn) 


(note that both f(X) — @ and the polynomial at the right hand side of the equality 
above are monic and have degree 1). Hence, 


a = f(—aj) — a = (—1)"(q@ + bi) (aj + b2)... (ai + bn), 
and this is the same as saying that the products of the numbers written in the unit 
squares of each row are all equal to (—1)"~!a. Oo 


As a consequence of the root test, we now describe a rather simple algorithm for 
finding the quotient of the division of a monic polynomial f € K[X] by X —a, with 
a € K being a root of f. Such an algorithm is known in mathematical literature as 
the Horner-Rufinni algorithm,! and is based on the following homonymous result. 


Proposition 15.5 (Horner-Rufinni) Let 
F(X) =X" + ay 1X" | +++ +a X +a 
be a monic polynomial on K. If a € K is a root of f and 
g(X) = X"-1 + by XK"? +... + bg € K[X] 
is the quotient upon dividing f (X) by X — a, then 


bn-2 = A+ An-1 
bn—3 = Abn—2 + Gn—2 
(15.1) 
by-i-) = Aby—ji + an—i 


bo =ab,+a, 


'Paolo Ruffini, Italian mathematician, and William G. Horner, English mathematician, both of the 
eighteenth and nineteenth centuries. 
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Proof By the sake of notational convenience, let b,_; = 1. Firstly, note that 


F(X) = (X — @)g(X) 
n—-1 


= (X —@) > ae 
i=0 


n—-1 


n—-1 
= ie _ Sobek 
i=0 f= 


n—1 n—2 
= by, X" + a Pps = oe: ae — abo 
i=l i=0 
n—1 n—-1 
= X"+ ree ae = chy; X" — abo 
i=1 i=1 


n—1 
=X" + DY On-1-1 — bn) X" — abo. 
i=1 


By comparing the last expression for f with that in the statement of the 
proposition, we conclude that 


by—1—i — bDy—i = An—i 


for 1 <i <n —1. This way, we obtain the linear system of equations 


bn-1 = 1 
by—2 — bby—| = An—1 


bn—3 — bbn—2 = An—2 


bn—i-1 — bn—i = An-i 


bo — abi =a 
—abo = ao. 


which, in turn, is readily seen to be equivalent to the equalities in (15.1) (note that 
the last equation of the system can be neglected, for it only represents the necessary 
compatibility condition for a to be a root of f). oO 


The table below, which can be seen as the actual execution of Horner-Ruffini’s 
algorithm, resumes the above discussion. Note that we put all of coefficients of f 
but the leading one (which equals 1) along the first row; then, in the second row 
we successively compute, from left to right, the coefficients of g, starting with the 


368 15 Roots of Polynomials 


leading coefficient b,_; = 1. Note that, starting with b;_2, each coefficient of g is 
equal to the product of the previously computed coefficient (i.e., the one at its left) 
by a, added to the coefficient of f situated right above this one. 


The Horner-Rufinni Algorithm 


an-1 an—-2 an-3 hips a\ ao 
by) = lia + 1 + apc» by—2 + n—2|-+ 0+ bg +aQ2, a+b; +44 
Se -_lo ee oO 


bn—2 bn—3 by bo 


The coming example shows the Horner-Ruffini algorithm in action. It also shows 
that the algorithm can be used to check whether some given element of K is actually 
a root of a given polynomial in K[X]. 


Example 15.6 Check that V2 is a root of f(X) = X°—5X*4+X3—5X?-6X+30 € 
R[X], and use the Horner-Ruffini algorithm to compute the quotient of the division 
of f by X — V2. 


Solution We begin assembling the numerical table corresponding to Horner- 
Rufinni’s algorithm by setting n = 5 and writing ag = —5, a3 = 1, a2 = —5S, 
a, = —6, ao = 30 along the first row: 


—5) 1 |—5|—6/30 


Now starting with b4 = 1 in the leftmost entry of second row, we successively 
obtain b3 = /2-1—5, by = /2b3 +1 = 3—5V2, by = V2b2 —5 = —15 432 
and by = /2b, — 6 = —15¥v2: 


5 1 5 —6 30 
f |f2 4 = $18 = 5/21 = 15-4 3/2 |= 15.72 


Finally, since —aby = —/2(—15./2) = 30 = ag, it follows that /2 is actually 
a root of f, and the quotient upon dividing f by X — V2 is X4 + (./2 — 5)X3 + 
B= 52)k* +615 pax = 152; Oo 


Back to the development of the theory and in the notations of the root test, if 
F(X) = (X¥ — a)" q(X), with g(a) F 0, we say that m is the multiplicity of a as a 
root of f. In particular, w is a simple root of f if m = 1, and a multiple root of f 
ifm > 1. 


Example 15.7 Let f(X) = X? — 7X? + 16X — 12 be a polynomial with real 
coefficients. It is immediate to check that f(X) = (X — 2)?(X — 3). Therefore, 
2 is a double root, whereas 3 is a simple one. 
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Later in this chapter we shall study in detail the problem of finding whether a 
given polynomial f € K[X] has multiple roots. For the time being, we only need 
to know what such a root is. Nevertheless, we shall observe that we already have a 
way of deciding whether an element aw € K is a multiple root of f € K[X] \ {0}: 
it suffices to check if f(@) = 0 and, if this is so, to divide f(X) by (X — a)? 
(which can be done by means of two successive applications of Horner-Rufinni’s 
algorithm), verifying whether or not such a division is exact. We shall have more to 
say about this in a while. 

Another interesting consequence of the division algorithm is the fact, due to 
Lagrange, that the number of roots of a nonzero polynomial cannot exceed its 
degree. In the coming result, note that we allow multiple roots. 


Corollary 15.8 (Lagrange) Jf f € K[X] \ {0}, then f has at most df roots in K, 


counted according to their multiplicities. 


Proof Let us make induction on the degree of f. If 0f = 0, then there exists c € 
K \ {0} such that f(X) = c. Hence, the polynomial function a associated to f is 
the constant function x +> c, so that f has 0 = df roots. 

Now, let f be a polynomial of positive degree, and assume that the result is 
true for every polynomial in K[X] \ {0} whose degree is less than that of f. If f 
has no roots in K, there is nothing left to do. Otherwise, let a € K be a root of 
f and (according to the root test) let m be the greatest natural number such that 
f(X) = (X — a)’"q(X), for some g € K[X] such that g(a) 4 0. Since 


dq = 0f —m < df, 


the induction hypothesis assures that g has at most 0g roots in K, counted with their 
multiplicities. 
In order to finish the argument, note that if 6 4 a is a root of f, then 


0= f(B) = (B—a)"G(B) 


and, hence, 6 is a root of g. Therefore, the number of roots of f (with multiplicities) 
equals m (the multiplicity of ) plus that of g. Thus, the induction hypothesis shows 
that f has at most 


m+ 0q = of 


roots in K. Oo 
The previous corollary will be frequently used in one of the two forms below. 


Corollary 15.9 If f(X) = a,X" +---+a,X +a9 € K[X] admits at leastn + 1 
distinct roots in K, then f vanishes identically, i.e., dn = ... = ag = 0. 
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Proof If f € K{X] \ {0}, then the former corollary assures it has at most n roots 
in K. Oo 


Corollary 15.10 Let f(X) =a,X"+---+a,X +o and g(X) = by X™ +-+-+ 
bi X + bo be polynomials in K[X], with m > n. If f(x) = g(x) for at least m+ 1 
distinct values of x € K, then f = g, i.e, m =n anda; = bj for0 <i <n. 


Proof It suffices to apply the previous corollary to the polynomial f — g, noticing 
that the associated polynomial function is f — g. Oo 


If Fix stands for the set of functions from K to itself, then, since K is infinite, the 
last result above guarantees that the mapping 


RL (15.2) 
ae A 

that associates to each f € K[X] its polynomial function f € Fx, is injective. In 

other words, it says that two polynomials on K will have equal polynomial functions 

if they are themselves equal polynomials.” Indeed, if f, g € K[X] are such that 

5 i = g, then f (x) = g(x) for infinitely many values of x, and the previous corollary 

assures that f = g. 

Thanks to the injectivity of the above map, from now one we shall write f to 
denote both an element of K[X] (i.e., a polynomial with coefficients in K) as its 
associated polynomial function. In particular, whenever we write f(X), we shall 
be referring ourselves to the polynomial f; whenever we write f(x), we shall be 
looking at the element of K, image of x € K through the polynomial function 
associated to f. Moreover, the context will always clear any possible doubts on this 
point. 


Remark 15.11 Corollary 15.10 assures that, thanks to the fact that K is infinite, the 
coefficients of f € K[X] are completely determined by the values f(x), with x € 
K. Later, in Chap. 18, we shall study in detail the problem of getting a polynomial 
in K[X] that assumes prescribed values at prescribed elements of K. 


The two coming examples illustrate typical uses of Corollaries 15.9 and 15.10. 


Example 15.12 (Moldavia) Find all polynomials f € R[X] such that f(0) = 0 and 


f@? +)D=/GY +1, Ve eR, 


7 As we shall see in Sect. 19.3, upon studying polynomials over Z p» for some prime number p, the 
fact that K is infinite here is actually indispensable for the injectivity of (15.2). 
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Solution Let the sequence (uy)n>0 be defined by up = O and uy4) = ur + 1, for 
n> 1. If g(X) = X, then f(up) = 0 = g(uo) and, by assuming f (uz) = g(ug), 
we get 


frp) = fug+) = fury? +1 
= (ux)? +1 Sug +1 = gucyi). 
Therefore, we conclude by induction that f(u,) = g(un) for every nonnegative 
integer n. However, since the values of the terms u, are pairwise distinct, we 


conclude that f(x) and g(x) coincide for an infinite number of distinct values of 
x. It thus follows from Corollary 15.10 that f = g,i.e., f(X) = X. Oo 


Example 15.13 (Hong Kong) Let g(X) = X° + X*+4+ X?+ X*+ X + 1. Compute 
the remainder of the division of g(X 12) by g(X). 


Solution The division algorithm assures the existence of g, r € R[X] such that 


g(X!?) = g(X)q(X) +r(X), 


with r = Oor0 < dr < 4. Letting w = cis and taking x = o* in the 


corresponding polynomial functions, with | < k < 5, we obtain 
g(o*) = go')q(o*) + r(o*) (15.3) 
forl1<k <5. 


Now, since w!2* = cis (4k) = 1, we get g(a!) = g(1) = 6. On the other 
hand, for 1 < k < 5, Lemma 13.18 furnishes 


8) a0" +0" +0" +o +0415 


Thus, (15.3) reduces to r(w*) = 6 for 1 < k <5, so that the polynomial r — 6 has 
at least five distinct roots. However, since r — 6 = 0 or 0(r — 6) < 4, Corollary 15.8 
givesr —6=0. Oo 


The coming further consequence of Lagrange’s theorem (Corollary 15.8) guar- 
antees that the image of the polynomial function associated to a nonconstant 
polynomial on K is an infinite set. 


Corollary 15.14 /f f € K[X]\K, then the image Im (f) of the polynomial function 
associated to f is an infinite subset of K. 


Proof Assume the contrary, i.e., that Im(f) = {a1,...,a%} C K. Then, for every 
x € Kwehave f(x) € {a1,..., a}. However, since K is an infinite set, there exists 
1 <i <k and pairwise distinct elements x1, x2, ... € IK such that 


f(x) = f(x2) =... = ai. 
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Hence, f(X) — qa; is a nonvanishing polynomial (for f is nonconstant) with an 
infinite number of distinct roots, which is a contradiction. oO 


Example 15.15 (Canada) Find all nonconstant polynomials f € Q[X] such that 
Ff(f(X) = bie. Sug for some given natural number k. 


Solution It is easy to see (cf. Problem 1) that for every x € Q we have f(f(x)) = 
f(x)*. Yet in another way, we have f(y) = y* for every y € Im(f). Now, the 
previous result assures that Im (f) is an infinite subset of Q, so that Corollary 15.10 
gives f(X) = X*. Oo 


Problems: Sect. 15.1 


1. * Show that the usual definition of composite function, when applied to 
polynomials f,g € K[X] \ {0} of degrees m e n, respectively, produces a 
polynomial in K[X]\ {0} of degree mn. Letting f og denote such a polynomial, 
show that its associated polynomial function coincides with the composite 
function f o g, where 7 and g denote the polynomial functions associated to f 
and g, respectively. 

2. * Leta, b andr be rational numbers, with r > 0 being such that ./r is irrational. 
Do the following items: 


(a) Fora fixed k € N, show that there exist az, by € Q such that (a+b/r)* = 
ag + by/r. 
(b) If f € Q[X] \ {0}, prove that f(a +b /r) =0 6 f(a— br) = 0. 


3. One of the roots of the polynomial X* + aX? + X? 4+ bX —2is 1 — V2. Find 
the remaining roots, knowing that a and b are both rational numbers. 

4. Leta, b € K be given, witha 4 0. For f € K[X]\ {0}, prove that the remainder 
of the division of f by aX + b is equal to f (—2). 

5. Does there exist a polynomial f € R[X] such that f(sinx) = cos x for every 
x? Justify your answer. 

6. Let w # km be acomplex number. Prove that, in R[X], the polynomial X* + 1 
divides the polynomial 


f (X) = (cosa + X sina)” — cos(na) — sin(na)X. 


7. Find all n € N for which X"*! — X” + 1 is divisible by X? — X + 1. 
8. (BMO) For m € Z, prove that the polynomial 


X4 — 1994x3 + (1993 + m)X* —11X +m 
does not possess two distinct integer roots. 


9. (AIME) Find all real numbers a and b such that X* — X — 1 divides aX!” + 
bx16 +1, 
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10. (Moldavia) Let n > 4 be a given natural number and p be a monic polynomial 
of degree n, having n distinct integer roots, one of which is 0. Compute the 
number of distinct integer roots of the polynomial p(p(X)). 

11. (Austrian-Polish) Let f (x) = ax? + bx? +cx +d be a polynomial with integer 
coefficients and degree 3. Prove that it is not possible to find four distinct primes 
P1, P2, p3 and pg such that 


If(pi)| = |f(p2)| = | fad] = | f(p4)| = 3. 


12. (Canada) Let p(X) = X” + an_1X"~! +---+.a,X + ag be a polynomial with 
integer coefficients, and assume that there exist distinct integers a, b, c and d 
for which p(a) = p(b) = p(c) = p(d) = 5S. Prove that there exists no integer 
m such that p(m) = 8. 

13. (IMO shortlist) Find all values of k € N such that X* + X + 1 divides X7* + 
1+(X +1). 

For the coming example, recall (cf. paragraph that precedes Problem 13, 
page 30) that a multiset is a collection {{a),a2,...,an}} of not necessarily 
distinct elements, with {{a), a2,...,@n}} = {{b1, b2,..., bm}} if and only if 
m =n and each element appears the same number of times in each multiset. 

14. Let {{a,, a2,..., a,}} and {{b1, bo, ..., bn }} be two distinct multisets, each of 
which formed by positive integers. If 


{{aj taj; lsi<j<nj}={{b + bj; lsi<j <n}, 


prove that n is a power of 2. 
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This short section studies the problem of finding rational roots of polynomials 
with integer coefficients, presenting several interesting applications. Item (a) of 
the coming proposition brings the central result, which is known in mathematical 
literature as the rational roots test for polynomials with integer coefficients. 


Proposition 15.16 Let n > 1 be an integer, f(X) = a,X" +--+ +a,X + ag be 
a nonzero polynomial with integer coefficients and p and q be nonzero, relatively 
prime integers. If f (2) = 0, then: 


(a) P | ao and q | ay. 

(b) If f is monic, then the possible rational roots of f are integers. 

(c) (p —mq) | f(m) for every m € Z. In particular, (p — q) | f 1) and (p + q) | 
f(-l). 
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Proof 


(a) Starting from f ( = 0, we readily obtain 


Gn p" + dn—1p"'q +-+-+.a1pq" | +aoq" = 0 


and, hence, 
1 -1 
ee mee 


agg” = P(—4n p"~ 
nm) agq”') . 


Gn p" = q(—an—-1pP 


Therefore, p | agg” and q | dn p”. However, since p and gq are relatively prime, 
item (a) of Proposition 6.22 assures that p | ao and qg | dy, as we wished to 
show. 

(b) This follows immediately from (a). 

(c) Since i) = 0, we have f(m) = f(m) — f@) or, which is the same, 


1 
f(m) = (anm” +--+ +aym + ao) — gn ane tek ay pg” * + agg"), 


Hence, 


q" fm) = q"(aqm" + +++ +aym +49) — (anp" +++ + a1pq" | +404") 
= an ((mq)" — p") +++ +aig""(mq — p) 
= (mq — p)r 
for some r € Z, where we used item (a) of Example 6.3 in the last equality. The 
above computations show that (mq — p) | q” f(m). Thus, in order to conclude 
that (mq — p) | f(m), it suffices to show (again by item (a) of Proposition 6.22) 


that gcd(mq — p, q”) = 1. To this end, by successively applying item (b) of that 
result, together with Corollary 6.23, we get 


gcd(p, gq) = 1 => ged(mq — p,q) = 1 > ged(mg — p,q") = 1. 


The rest follows at once. oO 


The previous result can sometimes be applied to establish the irrationality of 
certain real numbers. We shall now see a couple of examples in this direction. 
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Example 15.17 Prove that \/ 2 + 2 + V2 is irrational. 


Proof Lettinga = V2+V2+ af 2 we have a2 —2= /24+ /2 and, then, (a2 — 
2)? = 2+ /2. Therefore, ((a2 — 2)? — 2)? = 2, so that w is a root of the monic 
polynomial with integer coefficients 


F(X) = (X? — 2)? —2)* -2 
= (Xt — 4x? 4.2)? —2, 


Thus, if a € Q, items (a) and (b) or the former proposition assure that a € N and 
a | f(O) = 2, so that aw = | ou 2. However, since 


Leesy244242 <1/24V240=2 


we have reached a contradiction. oO 


Example 15.18 Prove that tan 10° is irrational. 


Proof We let a = tan 10°. Applying standard trigonometric formulas (cf. Chapter 7 
of [9], for instance), we successively obtain 


2 tan 10° 
tan20°+tan10° {omnis + tan 10° 
tan 30° = = 
1 — tan 20° - tan 10° | — 2tan2 1oe 
2179° 
1—tan? 10 


2a 
Loe + _ 3a — a3 


feet. Lau? 


l-—a@ 
‘ o ae 
However, since tan 30° = Fa we get 
Ba-a3)? 1 
(1 —3a2)? 3 


or, which is the same, 


30° — 2704 + 33a? —1=0. 
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Hence, tan 10° is a root of the polynomial with integer coefficients f(X) = 
3X° —27X4 + 33X? — 1, and it suffices to show that it does not possess any rational 
roots. For what is left to do, first note that, by Proposition 15.16, the possible rational 
roots of f are +1 or i nevertheless, direct computations show that f(+1) and 
f (£4) are all nonzero, as wished. oO 


Example 15.19 (Yugoslavia) Find all positive rationals a < b < c for which all of 
the numbers 


1 1 1 
a+b+c, ~-+—+-, abc 
a bee 


are integers. 
Solution Letting f(X) = (X — a)(X — b)(X —c), we conclude that a, b and c are 
the roots of f£; moreover, they are all rational. On the other hand, 


f(X) = X?-— (at+b+c)X? + (ab + be + ca)X — abc, 


with 


1 1 1 
ab-+be-+ ea =abe(< +5 +2) EN. 
a bee 


It thus follows that f € Z[X]. However, since f is monic, the rational roots test 
applied to f assures that all of a, b, c must be integers. 

Finally, since a, b and c are positive, we conclude that it suffices to find all 
natural numbers a < b < c for which i + : + : is also natural. In order to solve 
this problem, start by noticing that 


1 1 1 
ee ee 
a b ¢ 


so that a < 3. Then, by separately considering the cases a = 1, 2 or 3 and iterating 
the above reasoning, one easily finds 


(a,b,c) = (1,1, 1), C, 2, 2), (2,4,4), (2, 3, 6) or (3, 3, 3). 


(For instance, if a = 3, then b, c > 3, and hence 


ae ee ee | 
ug = 3 3 a 


so thata = b=c=3.) oO 
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We finish this section by presenting an interesting application of item (b) of 
Proposition 15.16. More precisely we shall characterize all rational numbers a for 
which cos(a@z) is also rational. To this end, we first need to introduce a special class 
of polynomials. 

Let (fn)n>1 be the sequence of polynomials with real coefficients given by 
fi(X) = X, fo(X) = X? — 2 and, for an integer k > 1, 


Se+2(X) = X fer (X) — fi (X). (15.4) 


The polynomial f, is called the n-th Bernstein polynomial, after the Russian 
mathematician of the twentieth century Sergei N. Bernstein. The next proposition 
unfolds some of its properties. 


Proposition 15.20 Jf (fi)n>1 is the sequence of Bernstein’s polynomials, then: 


(a) fn(2cos0) = 2cos(n@), for every 6 € R. 
(b) fn is monic, has integer coefficients and degree n. 


Proof For item (a), it is obvious that f;(2 cos @) = 2. cos @, whereas the formula for 
cos(2@) in terms of cos @ gives 


fo(2cos 6) = (2cos@)* — 2 = 2(2. cos” 6 — 1) = 2c0s(26). 


Now, let k > 1 be an integer and assume, by induction hypothesis, that 


fj(2cos@) = 2cos(j@) for 1 < 7 < k +1 and every 6 € R. The product formula 


cosa + cos B = 2cos (248) cos (254) gives 


2cos@ -2cos(k + 1)8 — 2cos(k@) = 2 cos(k + 2)0. 
Therefore, it follows from (15.4) and the induction hypothesis that 


Sk42(2cos@) = 2cosé- fe4i1(2cos 0) — fx(2 cos 0) 
= 2cos@ -2cos(k + 1)@ — 2cos(k@) 
= 2cos(k + 2)0 
for every 0 €R. 
Item (b) is trivially true for f; and f2. Arguing once more by induction, if f; 


and fx41 are monic, with integer coefficients and degrees k and k + 1, respectively, 
then the recurrence relation (15.4) readily assures that the same properties hold for 


Si-+2- Oo 


We are finally in position to present the promised result. 
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Theorem 15.21 [fa € Q is such that cos(az) € Q, then cos(am) = +1, 5 or 0. 


_—P —_m; 
Proof Let m, nN, Pid € Z such that n,q # Oand cos(7) = q: Fora = 7, item 
(a) of the previous proposition gives 


tr(~*) = fy(2cos(wx)) = 2.cos(nax) = 2cos(mm) = 2(=1)". 
q 


Now, item (b) of that result shows that 2P is a root of the monic polynomial with 
integer coefficients f,,(X) — 2(—1)”. Hence, item (b) of Proposition 15.16 gives 
ze € Z. On the other hand, since 


2 
A (=z) é [—2,2], 
qd n 


we get 7 = +2, +1 or 0. Therefore, : =+1, 5 or 0. oO 


Problems: Sect. 15.2 


1. (Canada) Let f be a nonzero polynomial with integer coefficients. If f(0) and 
Ff (C1) are odd, prove that f does not have any integer roots. 

2. (OCM) Prove that there does not exist nonzero integers x, y and z in arithmetic 
progression, such that x> + y? = 2°. 

3. Show that if (fn)n>1 is a sequence of polynomials of real coefficients such 
that f,(2cos@) = 2cos(n@) for every 0 € R, then f,, is the n-th Bernstein 
polynomial. 

4. Forn €N, let f, be the n-th Bernstein polynomial. Show that 


nyt eX i 
Z+ i =n Z+ ; 
x z 


for every z € C \ {0}. 
5. Find the roots of the polynomial 2(2X? — 1)* — X — 1 by calling one of them x 
and: 


(a) Letting y = 2x? — 1. 
(b) Imposing that x = cos 6. 


Then, use the previous items to compute cos 5. 
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6. Leta, b € Z be such that Va + Jb + Va—VJb €Q. 


(a) Prove that it must be an integer and a divisor of 8a>. 
(b) Assuming that a? — b is a perfect cube, do the following items: 


i. Prove that the given number is a divisor of 2a. 
ii. Characterize all such pairs of integers a and b. 


7. (Ireland) Prove that (2+ i)” 4 (2—1)” for everyn € N. 
The next problem partially generalizes Example 6.24. 
8. (OCM) Do the following items: 


(a) If none of the natural numbers aj, a2, ..., a, is a perfect square, show that 
Jay + faz +--+ + ./dy is a root of a monic polynomial f, € Z[X], of 
degree 2”. 


(b) Show that /2+ /3+ /5+ J/7+4+ vV111 is irrational. 


9. (IMO—shortlist) Given a, b € Q, can the polynomials X 5 _ ¥ — 1 and X24 
aX + b have a common complex root? 
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As a particular case of Corollary 15.9, every polynomial of complex coefficients and 
degree n has at most n complex roots. On the other hand, the polynomial f(X) = 
X* —2 € Q[X|] has rational coefficients but does not admit any rational roots; 
accordingly, g(X) = X* +1 © R[X] has real coefficients but no real roots. In 
both cases, the fact that such polynomials do not admit roots in Q or R is due to 
deficiencies of such number sets, in the sense that, in them, one cannot perform 
certain root extractions. 

On the other hand, Bhaskara’s formula shows (cf. Problem 1) that every second 
degree polynomial with complex coefficients has two (possibly equal complex 
roots), and the coming example looks at the third degree case. The reasoning 
presented below is essentially the one developed by the Italian mathematicians of 
the sixteenth century Scipioni del Ferro, Gerolamo Cardano and Niccolo Fontana, 
known as Tartaglia (in this respect, see also Problem 2). 


Example 15.22 A third degree polynomial f(X) = a3X? +a)X* +.a,X + ao with 
complex coefficients has a complex root. 


Proof We can obviously assume that a3 = 1. Also, for a givena € C, we have that 
z € Cisaroot of f(X) if and only if z—@ isa root of f(X + q@); since 


f(X +a) = (X +a)? + ay(X +0)? +a (X +a) +49 


= X34 Ba + a2)X7 + Ba? + 2ara +a1)X + f(a), 
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by choosing a = — - we can assume further that az = 0. 
We are thus left to showing that a third degree polynomial f(X) = X?+aX +b, 
with a, b € C, has complex roots. To this end, we substitute z = u + v to get 


f() =(utvy+au+v)+b 
=ur+vu?+3uv(u+v)+auut+v)+b 


=(w+u°+b)+3(u+v\(uv+a). 


Since we are trying to find some complex root for f, we try to impose that uv? +v? = 


—b and uv = —a, thus getting the system of equations 
w+v=-—b 
uv = —a : 


In turn, such a system can easily be solved, for u? and v° are the roots of the second 
degree equation X7 + bX —a*? =0 oO 


At this point, we could turn to fourth degree polynomials with complex coeffi- 
cients and, by performing even trickier computations, show that they have complex 
roots too. Instead, we turn to the general case, which was settled by Gauss in his 
doctor thesis, in 1799 (nevertheless, see Problem 3 and the subsequent paragraph). 
This is the content of the coming result, which is know in mathematical literature as 
the Fundamental Theorem of Algebra (we abbreviate FTA). The proof we present 
is due to the French mathematician of the eighteenth century J-B. le R. d’ Alembert. 


Theorem 15.23 (Gauss) Every polynomial f € C[X]\C has at least one complex 
root. 


Proof By the sake of notation, write f(z) = a,z”"” +--- +a ,z + do to denote the 
polynomial function associated to f. Without any loss of generality, we can assume 
that ag # 0. 

For z # 0, the triangle inequality for complex numbers (cf. Problem 3, page 326) 
gives 


An—1 an-2 ao 
lf (2)| = Izl" Jan + + 5 fof 
Zz 4 
l@n—1|— |@n—2 |ao| 
> |z|" G 7 sats |. 
[z| [z| [z| 


Therefore, letting 


2nlan—1| /2n|an—a1 eee (15.5) 


lan| : J/|an| — /|an| 


|z| > max | 
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we have eae! < Ma , so that 


lan| lan| 
n - = a 
FCO > lt” (lanl =n SA) = et 


nj 2n\ao| 


However, since |z| > iar 
n 


we have |dy||z|” > 2n|ao| and, hence, | f(z)| > 
n|ao| = |aol. 
In short, letting R denote the right hand side of (15.5), we get 


Iz] > R => |f@)I > laol = |F ODI. 


Now, in Sect. 21.2 we will show (cf. Corollary 21.21) that there exists zo € C such 
that |zo| < R and 


| f(Zo)| = min{| f(z)|; z € C, |z| < R}. 


Hence, what we did above assures that 


lf (Zo)| = min{| f(z)|; z € C}. 


By contradiction, let us suppose that f (zo) 4 0, and show that there exists h € C 
for which | f(zo + h)| < |f(Zo)|. To this end, we start by noticing that there exist 
complex numbers co, cj, ..., Cn, independent of and such that 

f(Zo +h) =a +41 (Zo +4) +--+ + 4n(Zo +h)" 
=cotecih+---+ enh"; 


moreover, co = f (Zo) # 0 and c, = a, 4 0. Take the least 1 < k < n such that 
cx # 0. Then, letting dj = a for 0 < j <n, we obtain 


zoth 
ae = |1 + dyh* + dyy ih! + +--+ dyh”| 
2 ii + dh" |+ldegh +e $a," | 
d d, 
= [1+ deh*| + fdghk| [Attn e-em 
dk dk 


Now, estimates analogous to those we did with the aid of (15.5) allow us to choose 
r > Osuch that [hl <r => [4th +--+ denr-*] < 5. Then, 


lf(zo +h)| i] k 
|h| <r => ————  < [1 +a h"| 4+ =|dgh"|. 
| f (zo) 2 
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Let d, = scisa and h = r cis @. The first de Moivre formula gives us 


If Zo + /)| 


1 
~~ < [1 + sr* cis (a + kO)| + =sr*; 
| f (Zo)| 2 


aa 


therefore, choosing 6 € R in such a way thata + kO = mr (ie., taking 0 = 77°), 
we get 


h 1 
We Goste) <|1+sr*cisa| + —sr* 
If Zo)l 2 
1 
olf tgp eh mane 
| sr’ | rad 
1 
=—1—-<srk < 1, 
2 
whenever sr* < l,ie.,0 <r < ae . With such an r (and with the corresponding 
h), we have | f (zo + A)| < |f (Zo)I. a 


An immediate consequence of the FTA is collected in the coming 


Corollary 15.24 If f(X) = a,X”" +---+a,X + ag is a polynomial with complex 
coefficients and degree n > 1, then there exist complex numbers Z\,..., Zn such 
that 


f(X) = an(X — 21)... (X — Zn). (15.6) 


The right hand side above is the factorised form of f. 


Proof We prove the corollary by induction on n, the case n = 1 being immediate. 
Take an integer n > 1, and assume that the corollary is valid for every polynomial 
with complex coefficients and degree n — 1. 

If z} € Cis a root of f (whose existence is guaranteed by the FTA), the root 
test assures the existence of a polynomial g with complex coefficients, such that 
F(X) = (X¥ — z1)g(X). Note that g has degree n — 1 and leading coefficient a,; 
hence, by induction hypothesis, there exist z2,..., Z, € C such that g(X) = a,(X— 
Z2)...(X — z,). Therefore, 


F(X) = (XK — 21) 8(X) = an(X — 21)(X — 22)... (KX — 2n) 


and there is nothing left to do. Oo 
A useful variation of the previous corollary is the one given by the next result. 


Corollary 15.25 If f(X) = a,X" +---+a,X + ag is a polynomial with complex 
coefficients and degree n > 1, then, givena € C, there exist z1,..., Zn € C such 
that f (ze) =a forl<k<n. 
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Proof Apply the former corollary to g(X) = f(X) —a. Oo 


Back to the general case, let f(X) = a,X" +---+ a,X + ao be a nonzero 
polynomial with coefficients in K. If there exist elements a1, ...,@,) € K for which 


F(X) = ay (X — 1)... (X — ay), 


we shall also say that the expression at the right hand side is the factorised form of 
f over K. 

Since some of the a;’s may appear several times in the factorised form, if we 
look only at the distinct ones we conclude, possibly after renumbering them, that 
there exist | < m <n and positive integers k,, ..., ky, such that 


F(X) = ag (X — a). (X — an), 


with kj +--- +k,» =n andaqj,...,Q@m pairwise distinct elements of K. 
The two coming examples give interesting applications of the factorised form of 
polynomials. 


Example 15.26 Given a natural number n > 1, do the following items: 


(a) Obtain the factorised form of f(X) = X"-! 4+ X"-74.--4 X41. 
(b) Let A} Az... A, be a regular polygon with n sides, inscribed in a circle with 
radius 1. Compute the value of A; A2- A,A3-...- AjAn. 


Solution 


(a) Since 
C=) Ke DO oN ae SS Sh 


the complex roots of f are precisely the n-th roots of unity distinct from 1. 
By (13.19), such roots are the complex numbers a, w,...,@"—!, witha = 
cis 2m Therefore, the factorised form of f is 


f (X) = (X — @)(X — @’)...(X - a"). (15.7) 


(b) Assume, without any loss of generality, that the circle of radius 1 circumscribing 
A ,Az2... Ay, is the unit circle centered at the origin of the complex plane, as well 
as that Aj = 1 and Az = , withw = cis an Then, Aj = w/—! for 1 < Jn, 
so that 


AjA2- AjA3-...- AjAn =|1—a||1—@7|...|1-—o" || 
= |(l1—w)(1—@*)...(l1—@"!)| 
= |f(@| =n. 
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The next example is concerned with Bernstein polynomials, so that the reader 
may find it convenient to recall their definition, in the paragraph that precedes 
Proposition 15.20. 


Example 15.27 Forn €N, let f, be the n-th Bernstein polynomial. Show that 


(2k — on 


fr(X) = rl (x 2608 —— 


k=1 


Proof Item (a) of Proposition 15.20 gives f, (2. cos 0) = Oif cos(n@) = 0. However, 
XT 
cos(n8) = 0 & n0 = a + kn, dkeZ 


ka 
S&é=—4+—, dake Z. 
2n n 


mx 3m (2n—1)x 

2n? Ine"? 2n ? 
all these values of @ give distinct values for cos 0. Therefore, 2 cos (=), 2.cos ( at), 
..., 2COS (Ca br) are n distinct roots for f,. Since f, it is monic and has degree 


n, the rest follows from Corollary 15.24. oO 


Now, fork = 0, 1,...,n—1, we get @ respectively equal to and 


Problems: Sect. 15.3 


1. Prove that the formula giving the roots of a second degree equation still holds 
to compute the complex roots of aX? + bX +c, witha, b,c € C anda ¥ 0. 

2. Leta, b € C, with a ¥ 0. In the notations of the discussion of Example 15.22, 
if ug € C is such that uR is a complex root of X? + bX — a°, show that the 
roots of X? + aX + b are given by uo + ug, Un@ + vow” and ugw* + v9, 
with v9 = =— and w = cis aE Moreover, show that such are distinct roots if 
4a? + b? £0. 

3. The purpose of this problem is to delineate the steps followed by the Italian 
mathematician of the sixteenth century Lodovico Ferrari to find the roots of a 
fourth degree polynomial X* + aX? + bX? +cX +d, witha, b,c,d €C: 


(a) If z € C is any complex number, check that 


4 3 2 2, &\? a’\ 5 
Z +az+bz tezt+d=(z +=) + (o- +): +cz+d. 
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(b) If z € C is a root of the given polynomial and w is any complex number, 
check that 


2, 4% 4 ay » 2 
(z +F+w) = (2w—b+ <)e + (wa—c)z+u* —d. 
(c) Show that if w is a root of the third degree equation 
2 
2 2 7 
(wa — 0)? =4(w = d)(2w ape =): 


then the equality of item (b) reduces to 


| 3 
+S twat 2w— b+ Fz+Vw?—d ; 


where ,/2w — b+ a and / w? — d stand for any complex square roots of 


2w —b+ © and w? —d. 
(d) Show, without resorting to the Fundamental Theorem of Algebra, that the 
given fourth degree polynomial has a complex root. 


The reader who went through Example 15.22 may have noticed that we 
reduced the task of finding a complex root for a third degree polynomial to 
that of finding a complex root for an associated second degree polynomial; 
subsequently, in the problem above we reduced the task of finding a complex 
root for a fourth degree polynomial to that of finding a complex root for an 
associated third degree polynomial. One is thus naturally tempted to guess 
whether it would be possible to extend such a recursive reasoning, thus reducing 
the task of finding a complex root for a general polynomial of degree n > 1 
to that of finding a complex root for an appropriately associated polynomial 
of degree n — 1. Attempts to implement such a strategy were unsuccessfully 
pursued by several mathematicians along the sixteenth, seventeenth and eigh- 
teenth centuries, until a general consensus started to form around the idea that 
this would perhaps not be possible. Actually, in the dawn of the nineteenth 
century, the Norwegian mathematician Niels H. Abel proved that there is no 
general formula for finding the roots of a fifth degree polynomial in terms of 
their coefficients; a few years later, the French mathematician Evariste Galois 
extended such a result for all polynomial of degree n > 5, obtaining the 
first major achievement of what is known today as Galois Theory. A gentle 
introduction to the involved ideas, with complete proofs, is the object of the 
marvelous book [20]. 
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* If f € R[X] \ {0} and z € C\R, prove that f(z) = 0 S f(z) = O. Then, 
conclude that every nonconstant polynomial with real coefficients has an even 
number of nonreal complex roots, counted with multiplicities. 


. * Prove that every polynomial with real coefficients and odd degree has an odd 


number of real roots. In particular, such a polynomial always has at least one 
real root. 


. If f € R[X] \ R is monic and has no real roots, show that there exist g,h € 


R[X] such that f = g? + h?. 


. *Let f € QLX]\ Q bea polynomial of degree n € N, anda ¥ 0 be a complex 


root of f. Given m € Z, prove that there exist bo, bj, ..., bn—1 € Q such that 


ql” =bo+biat::-+b,-10"!. (15.8) 


. Let f(X) = ay, X" +ay_1X"!4---+a,X +a be a polynomial with complex 


coefficients and degree n > 1. If z € C is one of its roots f, prove that 


A 
lz| < max {1 ” c 
lan| 


where A = max{|do], |ai|,--., |@n—1|}- 


. * Let f(X) = a,X” + ay X"-1+---+a,X +a be a polynomial with integer 


coefficients, such that a, > 1, and k > 2 be an integer for which |a;| < k for 
0 <i <n. If zis acomplex root of f, prove that Re(z) < 1+ s 

A polynomial f over C of the form f(X) = aX* + bX? + cX* + bX +4, 
with a ¥ 0, is said to be a reciprocal of degree 4 grau. Compute its roots and, 
then, formulate and solve the analogous problem for reciprocal polynomials of 
degree 6. 

(Singapore)’ Let f be a polynomial with real coefficients and degree n, such 
that f(k) = = for every integer 0 < k < n. Compute the possible values of 
f+. 


Given an integer n > 2, prove that 


_ nm , 2n , 3x ._ (n-l)x n 
sin — sin — sin —...sin = i 
n n n n 2 
For m € N, show that: 
Ga TT at DT - (m—l)n _ f/m 
(a) sin5,, Sins, ..-SiN 3 = het- 
: 4 : 20 : mr _ S2m+l 
(b) sin Pari SIN gay SIN = 


(Romania) Find all nonconstant polynomials p € R[X] such that p(X?) = 
p(X) p(X — 1). 


3For other approaches to this problem, see Problem 4, page 444, and Problem 3, page 450. 
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In this section, given f € C[X] \ C and z € C, we derive a useful necessary 
condition to be satisfied for z to be a multiple root of f. To this end, it is convenient 
to make the convention of saying that z € C aroot of f of multiplicity zero if z is 
actually not a root of f. We shall also need the coming 


Definition 15.28 Given a polynomial f(X) = a,X" +---+a,X +ao € C[X], we 
define its derivative* f’ €¢ C[X] of f as the polynomial 


f(D) Sneak” ! 4 G@ = Dagea kk”? 4 4X + 


if af > 0. Otherwise, we let f’ = 0. 


The rule for getting the derivative of a polynomial is rather simple: the derivative 
of a constant polynomial is the identically zero one, whereas the derivative of a 
polynomial of degree n > 1 is obtained by erasing its constant term and performing, 
for 1 < k <n, the monomial exchange 


a, X* be ka, X*-), 


The coming proposition establishes the main properties of derivatives of polyno- 
mials. 


Proposition 15.29 For fi,..., fx € C[X] and a,,..., ax € C, we have: 
/ - 

(a) (oh ai fi) = pee a f;. 

O(a) = eater aie 

Proof 


(a) This item follows immediately by induction on k > 1. Note that the initial cases 
arek = | andk = 2. 
(b) Firstly, let 


f(X) =a,X" +---+a,X +49 and 2(X) = bm X” +---+b,X + do. 


4The reader acquainted with Calculus has certainly noticed that the definition of f’ matches the 
one presented in Calculus courses by computing limits of Newton’s quotients. The point of the 
present definition and the subsequent proposition is that they will equally apply to polynomials 
over Zp, in Chap. 19. 
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By omitting X whenever convenient, it follows from (a) that 


(9 =[(Doaix!)e] = (Xa jxig)' = dw xigy. 


On the other hand, 


m re m 5 ah m = 
(ajX/g)' — (ax! > 4x’) — (do ajbix/*') 4 YS abixity 


i=0 i=0 i=0 


m 
= ot dajb)xi! 


i=0 


m m 
—— Do yap ee + Sige xs 
i=0 i=0 


m m 
= jajX!~' Sb) X! +ajX! \ ibj x! 
i=0 i=l 


= (ajX/)'g + (aj X/)g". 


Hence, going back to the previous expression for (fg)’, we obtain 


(fg) =o ((aj)X/)'g +4; X/8’) 


j=0 


=(@ wer (Loa x/),! 


j=0 


= (Siax!)¢ tee 


j=0 
= f'gt fe’, 


where we used item (a) once more in the next to last equality. 
Finally, the extension to k polynomials fi,..., f; € C[X] easily follows by 
induction. 
oO 


Corollary 15.30 Given g € C[X] andn €N, if f(X) = g(X)" then f'(X) = 
ng(X)"—! 9/(X). In particular, if f (X) = (X — a)", then f'(X) =n(X —a)"~! 
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Proof Letting k =n and f| =--- = fp = g initem (b) of the previous proposition, 
we obtain 


f'(X) = a(x)" 'g!(X) =ng(X)""!'(X). 


i=1 


The particular case follows from this formula, letting g(X) = X — a. Oo 


The coming result relates the multiplicity of a root of a polynomial to its 
derivative. 


Proposition 15.31 Let f € C[X] \ {0} andz eC. 


(a) If zis a root of multiplicity m > | of f, then z is a root of multiplicity m — 1 of 
/ 


(b) If z is a root of f and is a root of multiplicity m — 1 of f', then z is a root of 
multiplicity m of f. 


Proof 


(a) The root test allows us to write f(X) = (X — z)g(X), with g(z) 4 0. On the 
other hand, item (b) of the previous proposition, together with its corollary, give 


f'(X) = m(X = 2)" 9(X) + (X — 2)"8'(X) 
= (X — z)™"![mg(X) + (X — zg" (X)I. 


Hence, letting h(X) = mg(X) + (X — z)g’(X), we have h(z) = mg(z) 4 0 and 
f'(X) = (X — z)"'h(X). By the very definition of multiplicity, we conclude 
that z is a root of f’ of multiplicity m — 1. 

(b) Again by the root test, we can write f(X) = (X — zk g(X), for some integer 
k > 1 and g € C[X] such that g(z) # 0. Then, item (a) assures that the 
multiplicity of a as a root of f’ is k — 1, so that k — 1 = m — 1 and, hence, 
k=m. oO 


Corollary 15.32 [fz <¢ Cand f € C[X] \ {0}, then z is a multiple root of f if and 
only if f(z) = f(z) = 0. 


Proof If zis a multiple root of f, this means that it is a root of f with multiplicity 
is at least 2. Therefore, item (a) of the previous proposition implies that z is a root 
of f’. 

Conversely, if z is a root of both f and f’, then item (b) of the previous 
proposition shows that the multiplicity of z as a root of f is at least 2. In turn, 
this is the same as saying that z is a multiple root of f. Oo 
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Example 15.33 (Sweden) Prove that, for every n € N, the polynomial 


1 2 1 n 
14+ X+—X?4..-+—X 
2! n! 


has no multiple roots. 


Proof Let f(X) =14+X+ aX? feet aX", and assume that it has a multiple 
root z. Then, by the former corollary we have f(z) = f’(z) = 0. Now, since 
f®= f(%Y+ aX", it would follow that 


0=fW=f'@+5, 
nN: 


i.e., Zz = 0. However, since f(0) = 1 4 0, we have reached a contradiction. oO 


In order to refine the conclusion of the previous corollary, we first need to 
generalize Definition 15.28. We do this now. 


Definition 15.34 For f € C[X] \ {0}, we define the k-th derivative of f, denoted 
f®, by 


f= f ifk=0— 
(FY) stket 


It readily follows from this definition that f) = (f) = f’; thus, f? = 
(f™) = (f’), so that we denote f° = f”. Accordingly, whenever convenient 
we write f°) = f” ete. 

If of = n and 0 < k < n, an easy induction guarantees that of (k) <n-—k;in 
particular, af = 0 and, hence, f“t) = f@t?) =...=0. 


Corollary 15.35 [fz ¢ Cand f € C[X]\ {0}, then z is a root of multiplicity m > 1 
of f if and only if 
(Os=1=f" 7 @=0 ad fF") FU. 


Proof Suppose first that z is a root of multiplicity m of f. Several applications of 
item (a) of Proposition 15.31 give us, on the one hand, 


f@ == fO PHO 


and, on the other, that z is a root of multiplicity zero of f, ie., that f(z) 4 0. 
Conversely, assume that the stated condition holds and let k € N and g € C[X] 
be such that f(X) = (X — z)¥g(X), with g(z) # 0. A straightforward inductive 
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argument (using again item (a) of Proposition 15.31) assures that z is a root of 
multiplicity k — j of f, for 0 < j < k. In particular, 


f@=...=f*%~® =0 and f(%) £0. 
Now 
f@=...=f" W=s0Sk=m 
and 


f{™Q#0Sk<m. 
a] 


In the rest of this section, our purpose is to show that, if f € C[X] \ {0} has 
degree n and z ¢€ C, then f is completely determined by the values f(z), for 
0 < k <n. However, before we can do that, we need two further consequences of 
Proposition 15.31. 


Corollary 15.36 Let f € C[X] be such that f = 0 or af <n. Ifz € C satisfies 
f@=:-= FMR = 0, then f = 0. 


Proof If f # 0, then the previous corollary, together with the stated conditions, 
assure that the multiplicity of z as a root of f is at least n + 1. However, this would 
imply that (X —z)"*! would divide f, which in turn contradicts the fact that Af <n. 


a] 
Corollary 15.37 Let f, g € C[X] \ {0}, with of, dg <n. If there exists z € C such 
that 
IO=fe ais Se"), 
then f = g. 


Proof An easy induction on k > 1, with the help of Definition 15.34 and item (a) 
of Proposition 15.29, assures that 


(fF — 8) (X) = F(X) — gO, 
for every integer k > 0. In particular, the stated conditions give 
(F-9°@=f%WO-8@ =0, 
forO<k <n. 


Finally, since f — g = Oor 0(f — g) < max{df, dg} < n, the previous corollary, 
applied to f — g, assures that such a polynomial vanishes identically. oO 


392 15 Roots of Polynomials 


The coming result is known in mathematical literature as the Taylor formula® 
for polynomials, and is a direct consequence of the last corollary above. 


Theorem 15.38 If z € C and f € C[X] \ {0} is of degree n, then 


(Y= f@t+ (X —z)". (15.9) 


[X= 2) 


i a ) _£°@ 
n! 


Proof Let 


g(X) = f@)+ SEX - a) be Lt 2)", 


f'@) _L°@ 
nl 
Since f 4 0, it follows from Corollary 15.36 that at least one of the numbers f(z), 
}! Oyen ) (z) is nonzero; therefore, g 4 0. On the other hand, it is immediate 
to check that, for 0 < k < n, we have 


500 = Fag 


and, hence, 


f@=8@, f@O=8'@, - [OO = BO). 
However, since f and g both have degrees less than or equal to n, the previous 
corollary assures that f = g. Oo 


Example 15.39 Let f € R[X]\{0} anda € Rbe such that f(a) = Oand f(a) > 
0 for every k > 1. Prove that f has no roots along the interval (a, +00). 


Proof Letting n = df, Taylor’s formula gives 


F(X) = AK a) tet (X — a)". 


f'(a) f™@ 
n! 


Since f + 0, at least one of the derivatives f(a), say f(a), is positive. 
Obviously, | < j <n. Therefore, for a real number x > a, we have 


f@) =A @-a) tes + (x — a)" 


) 
> @ on (x —a)i > 0. 


ce a) f%@ 
n\ 


5Brook Taylor, English mathematician of the eighteenth century. 
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Problems: Sect. 15.4 


1. Find all integer values of a for which f(X) = X?—aX*+5X —2 has multiple 
roots. 

2. * Establish the following partial generalization of item (a) of Proposition 15.31: 
if f, g € C[X] \ {0} are such that g(X)? | f(X), then g(X) | f’(X). 

3. * For z1,...,2Zn € C, let f(X) = (X — 7)... (X — Zn). Prove that, for z € 
C\ {z1,---, Zn}, we have 


Pe 331 
f( gf tern 


j=1 


4. * Generalize the previous problem proving that, if fi,..., fg € C[X] \ {0}, 
f =fi... fx and z € C is not a root of f, then 


f@_ fl@.  A@ 
jo fo" Ae 


20 


5. *Letn > 1 be an integer and w = cis =. 


(a) Show that, for every integer 1 < j <n, one has 


(o/ —1)...(0/ — @"!)(o/ — wt!) .. (of — o"!) = nw Ys, 


(b) Compute, in terms of n, the value of P = [lezeejen (wo! — ow)’. 

6. Given u,v,w € C, with w ¥ 0, let (a,)n>1 be such that ay43 = uayy2 + 
vag4 1 + way for every k > 1. Also, let a, 6 and y be the complex roots of the 
polynomial X* — uX? — vX — w. 


(a) Ifa, B and y are pairwise distinct, show that a, = Aaw”~'!+BB"—-!+Cy""! 
for every n > 1, where A, B and C are the solutions of the linear system 


A+B+C=q 
aA+PB+yC=a2 
aA+ PPB+y°C =a43 


(b) Ifa = 6 ¥ y, show that ay = (A+ Bn)a"~!+ Cy"! forn > 1, where 
A, B and C are the solutions of the linear system 


A+B+C=aq, 
a(A+2B)+yC =a 
a?(A+3B)+y?C = 43 
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(c) Ifa = B = y, show that a, = (A+ Bn+Cn?’)a"! for n > 1, where A, 
B and C are the solutions of the linear system 


A+B+C=a, 
a(A+2B+4C) =a 
a?(A+3B+9C)=a3 


. * Solve the linear recurrence relation 


dn43 — 6dy42 + 12dp41 — 8dy = 0, 


knowing that d; = 1, dy = 4 and d3 = 14. 
We now revisit Problem 13, page 22, with the methods we developed in this 
section. 


. (USA) For each nonempty finite set S of real numbers, let o(S) and 2(S) 


respectively denote the sum and product of its elements. Prove that 


1 1 

3 A ree i ec ae 1, 
7(S) 2 is 

DASCIn 


. Prove the following theorem of Gauss: if f(X) = a,X” +---+a,X + ao is 


a polynomial of degree greater than 1 and with complex coefficients, then the 
roots of f’ are contained in the smallest convex polygon (possibly degenerated) 
of the complex plane having the roots of f as vertices. 

* Let f be a nonzero polynomial with integer coefficients and degree n, and 
a € Z, Prove that f@) €Zfor0 <j <n. 

Let f be a nonzero polynomial with integer coefficients and p be a prime that 
does not divide its leading coefficient. If m € N is such that 


f(m) =O(mod p) and f’(m) 4 0 (mod p), 
prove that, for every k € N, there exists mz € N for which 
f (my) = 0 (mod p*). 
(IMO—shortlist) The real numbers a, b and c are such that there exists exactly 


one square® whose vertices lie on the graph of the polynomial function f (x) = 
x3 + ax? + bx + c. Prove that the length of the sides of such a square is 72. 


One can show (cf. Problem 5, page 444) that, given a simple n-sided polygon, one can always 
find a polynomial function of degree at most n — 1 and whose graph contains the set of vertices of 
the polygon. 


Chapter 16 
Relations Between Roots and Coefficients — “zx 


This chapter is devoted to the proof of some important relations between the 
coefficients of a polynomial and their complex roots; such results are generically 
known as the relations between roots and coefficients of a polynomial. We also 
discuss an important theorem of Newton on symmetric polynomials, which will 
reveal itself to be of central importance for the material of Chap. 20. 

In all that follows, recall that IK stands for Q, R or C. 


16.1 Polynomials on Several Indeterminates 


For a given n € N, a polynomial f in n indeterminates over K is a sum of 
monomials of the form 


yh i 
Giy...in X4 elk x; ; 
where aj,..;, € Kandij,...,in vary in Z,, with aj,._;, = 0 for almost every (ie., 
for all but a finite number of) sequence (i), ..., in) of nonnegative integers. In this 


case, we write 


f = F(X, XQ, 0605 Xn) = >» Ce ee Xi 


The degree of a polynomial f as above is the greatest possible value of the sum 
ij +-+++ in, such that aj,_.;,, 4 0. 

We let K[X1, ..., X;] denote the set of polynomials in n indeterminates over K. 
On such a set we define, in an obvious way, operations 


+:K[X,..., Xn] x K[X1,..., Xn] ~ KLQ,..., Xn] 
© Springer International Publishing AG, part of Springer Nature 2018 395 
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and 


» KEX),..., Xn] x KLX1,..., Xn] ~ KLX,..., Xn], 


respectively denoted addition and multiplication, which extend the homonymous 
operations on K[X] and continue to be commutative, associative etc. For instance, 
if f(X1, X2) = X4 + X1X2 + X3 and g(X1, X2) = X} — V2X1 Xo, then 


f(X1, X2) + g(X1, Xo) = XP + 1 — V2)X1X2 + XP + XB 


and 
F(X, Xz) - g(X1, X2) = X} — V2X3 Xo + XFX2 — V2X2X3 
+ X}X3 — V2x1 xi. 
Given f € K[X, cei aen ls we can look at it as a polynomial in X;, with 


coefficients in K[X,,..., Xi,..., Xn], where we have put the hat A over Xj; to 
indicate that all of the indeterminates, except for X;, are present. For example, let 


f (X1, X2, X3) = XFX2XF — XF — 5X1 X2 + 10X1 X37 XF, 
be a polynomial in K[X 1, X2, X3]; writing 
f(X1, Xo, X3) = X}X2-XF + 10K XZ XZ — (XP 4+5X1 Xd), 


we consider f as a polynomial in X3, with coefficients in K[X,, X2]. 
If f, g € K[X,..., Xn] are given by 


PO Masses Ne) Se 2 DO Ha KO 


iy prety in=O 


and 


BOG Gee Ka OS Bid RO ou Res 


LT] ,.+-5 in=0 


we say that f and g are equal provided aj,_.;,, = bi,...;, for all possible choices of 
indices i1,...,i, € Zy. 

For fixed x1, x2...,X%n € K, we write f(x1, x2,..., Xn) to denote the element 
of K given by 


The coming proposition establishes an important relation between such elements of 
K and the notion of equality of polynomials in several indeterminates. 
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Proposition 16.1 Let f,g € K[X1,..., Xn]. If Ai,..., An C K are infinite sets, 
then 


f=8 > F(X, X2,...,Xn) = B(X1, X2,.--, Xn), 


for all x, € Aj, x2 € A2,...,% € An. 


Proof If f = g, then it is clear that f(x, x2,...,%) = g(%1, X2,.--, Xn) for all 
X1,X2,-.--,Xn € K and, in particular, for xj € A, x2 € Ao, ..., 2% € An. 
Conversely, suppose that this last condition is satisfied, and let us prove that 
f = g by using induction on the number n of indeterminates. Corollary 15.10 
already gives the validity of this claim for m = 1. Assume, by induction hypothesis, 
that the result holds true for polynomials over K in n — 1 indeterminates, and write 


f(X1, X2,..., Xn) = dno Fi Xa, «+1 Xn) XY 


oy (16.1) 
g(X1, X2,.--,Xa) = paw, gj (Xo, teey Kyat 


with fj, gj € K[X2,..., Xn] forallO<i<m,O0<j <p. 
For arbitrarily fixed x2 € A2,...,X,) € An, the hypothesis gives 


SF (x1, X2,---, Xn) = B(X1, X2,.--, Xn) 


for all xj € Aj, Le., 


m Pp 
> fj x2, ee | = D> gj (x2, Se ING 
j=0 


j=0 


for all x1 © Aj. Since A; is an infinite set, applying Corollary 15.10 to the 
polynomials in X 


m 
f(X1,%2, wes Xn) = cas, 
0 


and 


P 
g(X1, x2,...,Xn) = Yo gj(x2, eee PC 
j=0 


we conclude that m = p and 
fj (m2, -. +, Xn) = Bj (x2, -.-, Xn) (16.2) 


for 0 < j < m. However, since x2 € A2,...,X, € An were arbitrarily chosen, 
we conclude that (16.2) holds for all x2 € Ao, ..., x, € An. Hence, our induction 
hypothesis guarantees that f; = g; forO < j < m,sothat (16.1) gives f=g. O 
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Remark 16.2 From now on, whenever we deal with polynomials f over K in 
two indeterminates, we will generally write f(X, Y) instead of f(X1, X2). An 
analogous remark holds for polynomials f over K in three indeterminates, which 
will be generally denoted by f(X, Y, Z), instead of f(X 1, X2, X3). 


Example 16.3 Write (X + Y + Z)> — (X? + Y? + Z3) as a product of polynomials 
of degree 1. 


Solution Choose arbitrary y, z € R*, with y 4 z, and consider the polynomial in X 
g(X) = f(X y= (KX ty tz -X°- GO +2’). 
Since 
RIO ag Fey = (ay ty ee HO 


the root test assures that the polynomial (in X) f(X, y, z) is divisible by X —(—y) = 
X + y. Analogously, f is also divisible by X + z and, since dg = 2, item (c) of 
Proposition 15.3 guarantees that 


F(X, y, 2) = A(X + y)(X + 2), 
for some a € R to be found. By evaluating the equality above at 0, we obtain 
ayz = f0,y,2) =(¥+ 2 — (WP +2) = 3yey +2), 


so that a = 3(y + z). Therefore, 


F(X, y, 2) = 3(y + 2)(X + y)(X 4+ 2) 


and, then, f(x, y,z) = 3(y + z)(x + y)(x« + 2z) for all x € R and y, z € R*, with 
y # z. Proposition 16.1 now gives 


F(X, Y,Z) = 3(¥ + Z)(X + Y)(X4+ Z). 


Problems: Sect. 16.1 


1. *If f € K[X1, ..., Xn] does not vanish identically, prove that there exist infinite 
sets Aj,..., An C K such that f(x,..., xn) #0 for all x, € Aj,...,% € An. 
2. Do the following items: 


(a) Prove that the polynomial (X — Y)° + (Y — Z)° + (Z — X) is divisible by 
(X —Y)(¥ — Z)\(Z — X). 

(b) Write the polynomial (X — YP Hy Zy eZ 8) asa product of three 
polynomials of degree 1 and one polynomial of degree 2. 
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3. Write the polynomial (X + ¥ + Z)° — X° — Y° — Z> as a product of three 
polynomials of degree 1 and one polynomial of degree 2. 


16.2 Symmetric Polynomials 


For what follows, given n € N we recall that J, = {1,2,...,n}; also, a function 
o : I, — I, is said to be a permutation of J, if it is a bijection from J, into itself. 
The object of study along this section is isolated in the coming definition. 


Definition 16.4 A polynomial f € K[X,,..., X;] is symmetric if 
Sf (X11, X2,..., Xn) = f (Xo), Xo), +--+, Xon))s 


for every permutation o of I,. 


For a better understanding of the above definition, let us look at the polynomials 
S, g € KLX, Y] given by 


FOL VIS REY aA aR ond oR Yi ae Or Se, 
The first is symmetric, whereas the second one is not, for 
fU, X= V+ X* -YX + V4 ¥ = (X,Y), 
and 
e(Y, X)=Y3+X*°-Y A g(X,Y). 


A particular set of symmetric polynomials, called elementary, deserves special 
attention, and this is the object of the coming definition. 


Definition 16.5 For 0 < j <n, the j-th elementary symmetric polynomial in 
X1,..., Xp, denoted s; = sj(X1,..., Xn), is defined by 


(x Xy) 1, if 7 =0 
Sj(Xq,..., = . : ; 
J n ae eee pop ene ere ifl<j<n 


In the case n = 3, for example, we have 
So=l, sp =X+YV+Z, 99 =XVY+YVZ4+ XZ, 33 = XYZ. 


For an arbitrary natural number 7, it is not difficult to prove that s; is indeed 
symmetric. On the other hand, the actual importance of the elementary symmetric 
polynomials lies in the next proposition, which establishes Girard-Viéte formulas! 
between the coefficients and roots of a given polynomial. 


‘Francois Viéte and Albert Girard, French mathematicians of the sixteenth and seventeenth 
centuries, respectively. 
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Proposition 16.6 (Girard-Viéte) Let f(X) = a,X" +---+a,X +ap € K[X]\K 


have roots a, ...,Q@y, € K, repeated according to their multiplicities. For 1 < j < 
n, one has 
= jai 
Sj(Q1,..., Qn) = (—1)’ —. (16.3) 
an 
Proof By the sake of simplicity, we shall write s;(a@;) to denote s;(a,..., Qn). 


According to our hypotheses, the factorised form of the polynomial f is f(X) = 
An(X — a1)(X — a2)... (X — ay). Expanding the products, we obtain 


F(X) = an X" — ansy (aj) X"—! + aso (oj) X"? — +++ + an (—1)" sn (4). 


It now suffices to compare such an expression for f with that given in the 
statement of the proposition to obtain 


Gy S81 (Oj) = Apis. Gp 82 (OY = dyads 604 Gy(— 1)" SplOy) = ao: 


Oo 


In order to get the right feeling on what the previous proposition says and what 
it does not say, let a, 6 and y be the complex roots of the polynomial f(X) = 
X3 — 2X? + 1. Girard-Viéte relations give 


a+PBt+y=2, aB+ay+ By =0 and apy = —-1. 


However, it is worth noticing that such relations do not bring enough information 
for us to compute the values of a, 6 and y; indeed, if we try to solve the system of 
equations formed by them, we will simply obtain the polynomial equations f(a) = 
0, f(B) = Oand f(y) = 0. Actually, by multiplying both sides of the second 
equation by a, we get 


a°(B +y) +aBy = 0; 
in turn, substituting 6 + y by 2—a@ and aBy by —1 in this last equality, we arrive at 
2 = 
a“(2—a)—-1=0. 


In the notations of the former proposition, we shall refer to s;(a@1,...,0@n) € K 
as the j-th elementary symmetric sum of the roots of f. Whenever a1,..., Qn 
are understood and there is no danger of confusion with the elementary symmetric 
polynomial s; = s;(X1,..., Xn), we shall write simply s; to denote such a sum. 

We now present a series of examples that illustrate a variety of different situations 
to which one can profitably apply Girard-Viéte relations. 
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Example 16.7 Let f(X) = X" + ap_,X"~! + an_.X"~7 +--+» + a,X +g bea 
polynomial of real coefficients, such that ay < 2an—2. Prove that f has at least 
two complex, nonreal roots. 


Proof By contradiction, assume that ang < 2a,—2 but at least n — | of the roots 
of f are real, say a1,...,@,—1 € R. Since f and g(X) = (X — a1)... (X — ay_1) 
have real coefficients, we get from the division algorithm that 


S(X) = g(X)(X — an), 


for some a, € R. Therefore, all roots of f are real. 
Now, by applying Girard-Viéte relations, we obtain 


Hence 


which gives us the desired contradiction. Oo 


Example 16.8 (Croatia) Let a, b and c be nonzero real numbers satisfying a + b+ 
c = 0. Prove that 


O+P+e P+hP+0\ (a+b? +c 
ie oe ae 
Proof If f(X) = (X — a)(X — b)(X — c), then condition a +b+c = 0 gives 
f() = X3+5X —t, with s = ab+ac + bc and t = abc. Now, since f@ =0, 
we obtain a> = —sa + t and, analogously, b> = —sh + t and c? = —sc +t. By 
termwise addition of these relations and using condition a + b + c = 0 once more, 
we get 


O+h4+e0e= —s(a+b+c)+3t =3t. 
In order to deal with the sum a? + b° + c>, we start by multiplying both sides of 
the equalities a> = —sa +t, b> = —sb +t and c? = —sc +t respectively by a’, 
b* and c”; we then add them termwisely to obtain 


PLP POS HOLE tO) FG FeO): 


In the right hand side of the above expression, substitute a? + b? + c? = 3t and 
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a+b +c =(atb+c) —2(ab+ac + be) = —2s 
to obtain 
oP He j= —9-Sr+t(-29) = 5, 


The stated equality is, now, obvious. oO 


Example 16.9 (OCM) The roots of the polynomial f(X) = X? — 7X? + 14X — 6 
are the lengths of the sides of a triangle. Compute the value of its area. 


Solution Let a, b and c be the roots of f and A be the area of the triangle with 
sides a, b and c. Heron’s formula for the area of a triangle (cf. Chapter 7 of [9], for 
instance) gives 


A* = p(p —a)(p — b)(p — 0), 


where p is the semi-perimeter of the triangle. On the other hand, by Girard-Viete 
relations, we get 


a +b+c) : (a, b,c) i 
= => => — a, ; =>Tt, 
Dp 5 c 551 c 5 


so that 


Now, the factorised form of f gives 


FOSS =O Sil $6 a" = 7 + 6 


and, hence, 
iC Om lem) oa. 
= GG) -1(5) + 14(2) t= ; 
Thus, 
i 7 7 1 7 
ee er 16’ 
whence A = “2, oO 


4 
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Example 16.10 (Romania) Let a, b, c and d be real numbers such that 


a= V4-WVJ5-a, b=V44+V5-) 
c= V4-VJ5+candd=/44+VJ5+4d. 


Compute all possible values of the product abcd. 


Solution Note that a? = 4 — ./5 — a and, then, (a* — 4)? = 5 — a or, which is the 
same, a* — 8a2+a+11=0. Analogously, we have b* — 8b +b+11 = 0, so that 
a and b are roots of the polynomial 


(CO =k H=8k? EX + U1, 


Likewise, we conclude that c and d are roots of the polynomial X* — 8X? — X + 11, 
so that —c and —d are also roots of /. 

Therefore, all of a, b, —c and —d are roots of f, and if we know that they are 
pairwise distinct, we shall conclude that they are all of the roots of the polynomial 
jf; in turn, Girard-Viéte relations will give 


abcd = ab(—c)(—d) = 11. 


For what is left to do, assume we had a = b. This would give /4— /5 —a = 
V4+ /5 — a and, hence, a = 5. However, since 5 4 /4— J/5 —5, we actually 


have a ¥ b. Similarly, we prove that c # d. Finally, since —c, —d < 0 < a,b, we 
are finished. Oo 


Example 16.11 (Romania) Vf x1, x2,...,X, ate positive real numbers for which 
X1X2...X, = 1, prove that 


n 1 
err 
n-1+x; 


j=l 


Proof Let p(X) = (X +.x1)...(X + xn). Problem 3, page 393, gives 


p(x) aS i 
X+ Xj 


Px) > 
for x # —x1,..., —X,. Hence, we want to show that 
/ _ 
p(n-V) _ 


pin—1) ~ 
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For what is left to do, for 1 <i <n leta; = s;(x1,..., X,) be the i-th symmetric 
elementary sum of x1, ...,X,, and set ag = 1. Then, we have 


n-1 


n 
p(X) =) fajX"F and p(X) = D(a = j)ajyx" I", 
j=0 j=0 


so that it suffices to show the inequality 
n n=1 
Yi aja-14 = Vo @—/paja-yr 
j=0 j=0 


or, which is the same, 


n—-1 


Yo aj(i — IG — 17! + aay > nag(n — 1)"7! — ao(n — 1)". 
j=l 


Finally, we want to prove that 
n 
Yoaii-Da-yr tt >sa-yr 
j=l 


To this end, the inequality between the arithmetic and geometric means (cf. [8], for 
instance) furnishes 


n n 
aj = > Xiy + Xi; > (‘) G 


ip<-<ij 


Therefore, it is enough to show that 

"fn 
ye ( ‘a —)@-) > @-1"". 
jal V 


We claim that this last inequality is actually an equality. Indeed, the equality 


x*=(X-14+D"=)> (i) — yyr-k 


k=0 


give us, upon differentiation, 


n—-1 


nX"-! = (n—- a(t) (x — rk, 


k=0 
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Now, by evaluating the corresponding polynomial functions at x = n, it comes that 


n—-1| 
n” =S(n- o(t) (n — 1)"-*! 
k=0 
n—-1| ‘ 
=) In-l)-k- »i(?) (n— rk! 
k=0 
(0 n—k = n n—k-1 
= (;) (n — 1) x: v(f) (n — 1) 
k=0 k=0 
On the other hand, 
n=(n-1+)"=)> (je -1yr*, 
k=0 
so that 
n * n—-1 * n—-1 i 
y (;) (n—1)"* = xe (‘) g=1y Sik v(f) G1 
k=0 k=0 k=0 


After performing the obvious cancellations, we arrive at the equality 


n-1 


n Eh n n—k-1 _ 
("\in- 1) poe D(p Je 1) =, 


which is the same as 


ik v(f) (n—1)""*-1 =0, 
k=0 


At last, from the above we can write 


n n—0-1 “ n n—k-1 _ 
o-v(f)@=n +e v({Jo-p = 0, 


which is precisely the desired equality. Oo 
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1. Let f, g and h be nonzero polynomials in K[X,,..., Xn], such that f and g 
are symmetric and f = gh. Prove that h is also symmetric. 
2. * A polynomial f € K[X1,..., Xn] is said to be homogeneous of degree k if 


FGM otha SE OG 


for every t € K. Find, up to multiplication by a constant, all symmetric and 
homogeneous polynomials of degree 2 in R[X, Y, Z]. 
For the next two problems, S, stands for the set of all permutations of /, 
3. In R[X,,..., X,], n > 1, let be given a nonzero polynomial 


Fl Rivsag Ma Sa aeX a 


lisasls 


Foro € S,, define f° € R[X1,..., Xn] by 


f° (X1,..., Xn) = f (Koay, +--+, Xe) = Giy...a Xo) oh aye 


i,.stn 


In this respect, do the following items: 


(a) Prove that f°°’ = (f°)', for allo, t € Sy. 

(b) If f(X%1,..., Xn) = Theicjen(% — Xj), then f° = +f for every o ¢ 
Sn. 

(c) For f as in (b), write f° = sgn(o)f, with sgn(o) € {+1}. Show that 
sgn(o oT) = sgn(o)sgn(t), for allo, t € Sy. 

(d) Let Ay = {o € Sy; sgn(o) = 1} (we call the elements of A, even 
permutations). For 0, t € An, show that o o t and o~! also belong to 
An. 


(e) Given 1 < k <1 <n, we let ty denote the transposition corresponding 
to k and J, i.e., the permutation of J, such that 
i, ifiFk,l 
tm iGi)= 4/1, ifis=k 
k, ifi=l 
Prove that sgn (t%;) = —1, forall 1 < k <I] <n. 


(f) Show that |A,| = 4. 
4. For f € K[X1,..., Xn], let g € K[X),..., X;,] be defined by 


1 oO 
aoe Oe ie 


oESy 
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10. 


11. 


12. 


13. 


where f° is given as in the previous problem. Prove that g is a symmetric 
polynomial, called the symmetrization of f, and that g = f if f is itself 
symmetric. 


. (Brazil) It is known that the polynomial f(X) = X?+ pX-+q has three distinct 


real roots. Prove that p < 0. 


. (Croatia) Let a, b and c are pairwise distinct real numbers satisfying the system 


of equations 


a? = 3b? + 3c? — 25 
b? = 3c? + 3a? — 25 , 
c? = 3a? + 3b? — 25 


Compute all possible values of the product abc. 


. Given a,b,c € C find, as a function of a, b and c, the coefficients of a monic 


polynomial of degree 3 whose complex roots are the cubes of those of X? + 
aX*+bX +c. 


. Let p,g,r € R and assume that the complex roots of the polynomial X* + 


pX* + qX +r are three distinct and positive real numbers. Show that such 
roots are the lengths of the sides of a triangle if and only if p> —4pq + 8r > 0. 


. (Romania) Let a, b and c be nonzero complex numbers, such that 


1 1 1 
a+b+c=-+-+4+-=0. 
a boc 
If n is a positive integer, show that a” + b” +c" = Oif and only if 3 { n. 
(Hong Kong) Let a3, a4, ..., aio be real numbers, with ajo9 ~ 0. Prove that 


the polynomial 
F(X) = ayo X! + agg X? +--+ -+a3X74+ 3X7 42X41 


has at least one nonreal root. 

Consider all straight lines that meet the graph of the polynomial function 
f(x) = 2x4+ + 7x3 + 3x — 5 in four distinct points (x;, y;), for 1 < i < 4. 
Prove that the number 4% +x2+x3-+.2x4) does not depend on the chosen line, 
and compute its value. 

(Moldavia) In the cartesian plane, a circle intersects the hyperbola of equation 
xy = | in four distinct points. Prove that the product of the abscissas of the 
intersection point is always equal to 1. 

(Canada—adapted) The complex numbers a, b and c are the roots of X? — X?— 
xX-1. 


(a) Prove that a, b and c are pairwise distinct. 
(b) If, for each n € N, we set 
a" —b" bh —c"— ca" 


S.= + + 5 
a a—b b-—c c-—a 


show that Sx4.3 = Sx42 + Sea1 + Sx for every k EN. 
(c) Conclude that S, € Z, for every n € N. 
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14. (BMO—adapted) For real numbers x and y such that xy 4 —1, define 


l+x+y 
xX*eyY= 
l+xy 
(a) Given n > 2 positive reals x1, x2, ..., Xn, show that 


SE S31 ES 
L+sg+sqgt---+5)’ 


x1 * (XQ * (+++ *Xn)--+) 


where i and p stand for the largest odd and even natural numbers less than 
or equal to n, respectively, and s; denotes the j-th elementary symmetric 
sum of x1, X2,...,Xp- 

(b) If f(X) = (X +41) (X +22)... (X +X), show that 


Xp * (42 * (+ KX) oD = fe) 
if fD+C)"fED | 


15. Given an integer n > 1, find all real solutions of the system of equations 


Xp +x2++++ +X, =n 


xi tapte-taean 


xp tagte+xan 


16. (IMO—shortlist) Let f(X) = X” +ay_1X"~!+---+a,X +1 be a polynomial 
of nonnegative real coefficients and real roots. Prove that f(x) > (x + 1)” for 
every real number x > 0. 

17. (ireland) Find all polynomials f(X) = a,X" +---+a,X + apo satisfying the 
following conditions: 


(a) aj € {-1, l} forO <j <n. 
(b) All roots of f are real. 


16.3. Newton’s Theorem 


Yet with respect to symmetric polynomials, note that f(X, Y, Z) = X*+Y4+4 Zz? 
is symmetric, albeit not one of the symmetric polynomials in X, Y, Z which we 
called elementary. Nevertheless, letting 5; = s,(X, Y, Z), so = s2(X, Y, Z) and 
53 = 53(X, Y, Z), we can write 
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FOL Ty Sh ey = OY Xr) 
= [x 4+Y4Z)?-2XY+XZ+ vz)] 
2 [uxy 4XZ+4YZ)2 —2XYZ(X+Y 4 Z)| 

= (s? — 259)” — 2(s> — 25153) 

= st _ Ast so + 282 + 45153 

= g (51, $2, $3), 
where 

RY Z) = OSA y Oy? ay: 


Thus, at least in this particular case, we were capable of expressing the symmetric 
polynomial f(X,Y,Z) = X*+ ¥*+ + Z* as a polynomial in the elementary 
symmetric polynomials in X, Y, Z. 

Actually, the coming result, usually attributed to Newton and known in math- 
ematical literature as the fundamental theorem on symmetric polynomials, 
assures that this was no accident. In what follows, K includes the possibility K = Z. 


Theorem 16.12 (Newton) /f f = f(X1,..., Xn) € KLX1,..., Xn] is symmetric, 
then there exists g € K[X,,..., X,] such that 


F(X, .--, Xn) = BS, --- Sn), 


where S1,...,8, € K[X1,..., Xn] are the elementary symmetric polynomials in 
DG eee, Con 


For the proof of this theorem, we shall need to introduce some preliminary 
concepts. Firstly, let us order the monomials in K[X1,..., X,] in the following 
way: given two distinct n-tuples (i1,...,i,) and (j1,..., jn) of nonnegative 
integers, and monomials avi X eee and bij) Xt! : Pn ek in K[X,..., Xn], we 
define 


aX! a xin ~< bij) x! xn 


Oi Wate (16.4) 
Se a oe cae 
Ik < Jk 


In this case, we say that bij Xf ... XJ" is greater that aq) X'} ree x 

In general, we shall refer to the ordering above as the lexicographic order in 
the monomials of K[X,,..., X;,]. In particular, for f € K[X,,..., X,] \ {0}, we 
define its leading term as the maximum monomial with respect to the lexicographic 
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order. Note also that in K[X] one has 1 < X < X? < ---, so that the above notion 
of leading term of a polynomial in several indeterminates coincides with the usual 
notion in one indeterminate, which is given by the degree of the monomial. 


Example 16.13 If f = st 1 she with s; € K[X,..., X,] standing for the i-th 
elementary symmetric polynomial, then f has leading term 


one ‘es k k 
ee +kn x +kn ar 4 n—1t mie (16.5) 


n—1 


Indeed, since 


p=(Dx) (Day Orn 


i<j 
the definition of lexicographic order guarantees that its leading term is 
k 
Xy) (X10) 1 XoXs)” ... K1... Xn), 


which is exactly (16.5). 


With the concept of lexicographic order at our disposal, the key to the proof of 
Newton’s theorem lies in the coming auxiliary result. 


Lemma 16.14 Jf f © K[X1,..., Xn] is symmetric and aa) X4' ... Xe" is its 
leading term, then 


Oy So Sy. 


Proof If a is the largest exponent appearing in some monomial of f, then, since f 
is symmetric, it contains a monomial with x If agi) X}' __X! is a monomial in 
f such that i; < a, it follows from the deaniion of iexisopraphic order that such 
a monomial is not the leading one. Hence, the leading term of f must contain X rt : 

Now, among all of the monomials in f containing X{', choose one with 
maximum exponent in one of the indeterminates X2,..., Xn, say exponent a. 
By invoking again the symmetry of f, there exists such a monomial contain- 
ing x, ae a3 moreover, from the choice of aj we have aj > ap. In turn, if 
aX"! x? ... X!" is a monomial in f such that ir < a, then (again from the 
definition of lexicographic order) such a monomial is not the leading one, which 
shows that the leading term contains a Xs Finally, by repeating the above 
argument more n — 2 times, we obtain the desired result. oO 


We can finally present the proof of Newton’s theorem. 


Proof of Theorem 16.12 Take f € K[X1,..., Xn] symmetric, with leading term 
di Xy ee. 6 . By the previous lemma, we have a; > --- > dy. On the other 
hand, Brains 16.13 guarantees that the symmetric polynomial 
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2(Xq,..., Xn) = iss sy = eer a 
also has Aig X)' ...X"" as its leading term, so that the symmetric polynomial 
f — g has leading term ace) X¢! XP" with apy X4! xen Gig Xy eka 
in the lexicographic order. However, since K[X1,..., X,] contains only a finite 
number of monomials which are less than aX, ... Xn" with respect to the 


lexicographic order, a finite number of repetitions of the algorithm above gives us 
f -— g = I(s1,...,5n), for some polynomial / € K[X,,..., X;]. Thus, the same 
happens with f. Oo 


It is worth noticing that Newton’s theorem is mainly an existence result. Indeed, it 
is possible to prove that, for generic symmetric polynomials, the algorithm described 
in the proof of Theorem 16.12 does not finish in polynomial time (i.e., even with the 
aid of a good computer, we will not be able to use the algorithm to actually express, 
in finite time, a generic symmetric polynomial in n indeterminates as a polynomial 
in the corresponding elementary symmetric polynomials). Nevertheless, the coming 
example shows that the mere existence guaranteed by Newton’s theorem can be 
quite useful. For another interesting application, see Sect. 20.1. 


Example 16.15 (Miklés Schweitzer) Let f € Z[X] be a nonconstant polynomial 
and w = cis an where n > | is an integer. Prove that 


f@)fo)...f@"") €Z. 
Proof Consider the polynomial g € Z[X1,..., Xn—1] given by 
g(X1,...,Xn-1) = F(X) f (X2).-- f(Xn-1)- 
If o is a permutation of [,-1, then 
{o(1),0(2),...,0(7— 1} = {1,2,...,n— 1}, 
so that 


8(Xo(1),+++s Xo~n—1)) = f (Xo) f (Xo) .-- f(Xo(n-1)) 
= f(X1) f(X2)... f (Xn-1) 
=> g(X, earn Xn): 


Thus, g is symmetric and Newton’s theorem assures the existence of a polynomial 
heZ[X1,..., Xy—1] such that g(X1,..., Xn—-1) = A(s1,.--, Sn—1), Le., 


f (X41) f (X2)... f(Kn-1) = hs, -- 5 Sn—-1), 


with s; standing for the j-th elementary symmetric polynomial in X),..., Xn-1. 
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Substituting X; by «w/ in the equality above, we obtain 


f(@)f(w’)... f(@""!) = h(si(@,..., 071), ..., 5n-1(@,..., 04). 
Therefore, in order to show that f(w) f(w*)... f(w"~!) € Z, it suffices to show 


that sj(@,...,@""!) eZ forl <j <n-1. 
For what is left to do, just note that 


Mo Ss HIE Sw Ho). Ko) 


=(X-1(X" 14 X72 4---4+ X40, 
so that 


A ee Se RT aK aM aw). ew 
n—-1| 
= Yi! sj, O g.30 XR 
j=0 


Hence, we have 
S)(@,@ a" * = (—1)/ EZ 
jh@, goers ; 


as wished. oO 


The discussion preceding the former example hints to the following fact: if we 
wish to effectively express a given symmetric polynomial as a polynomial in the 
elementary symmetric polynomials in the same number of indeterminates, we shall 
usually have to rely on ad hoc arguments. In this sense, we end this section by 
discussing a relevant example, for which we shall need the following consequence of 
Horner-Rufinni’s algorithm. Identities (16.6) below are known as Horner-Ruffini’s 
identities. 


Lemma 16.16 (Horner-Rufinni) Let 
F(X) = XM 5 XP E pt (O15 1X + (- 1)" 
be a nonconstant polynomial over C. If z is a complex root of f and 
e(X) =X EDX" 4 + bn 


is the quotient upon division of f (X) by X — z, then 
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bh =z-S} 


by = 2 — s1z+ 82 


(16.6) 
bag = Zt) = gy zk tee + (- DH eg 


Dn—-1 = zn-l _ syzt-? hee (-1)""! 5,-4 
Proof In the stated notations, the recurrence formulas (15.1) can be written as 


bh =zZ-S]) 
by = zbi + 82 


bes = zby + (1) 5441 


bn-1 = Zbn-2 + (— eae 
By solving the linear system above successively for b2, ..., by-1, we obtain, one by 
one, the stated relations. oO 
The relations contained in the coming result will give us recurrences that express 
the symmetric polynomial 


ox(X1, Xo,..., Xn) = XE +XFt--- + XE 


as a polynomial in the elementary symmetric polynomials in Xj, X2, ..., Xn. 
The formulas of items (a) and (b) will be respectively referred to as the first and 
second Jacobi’s identities.” For a proof of them using generating functions, see 
Example 21.19. 


Proposition 16.17 (Jacobi) For z1,..., Zn € C, ifs; = sj(z1,..., Zn) denotes the 
i-th elementary symmetric sum of z1, ..., Zn and ox = z* 


{eee Zz then: 
(Q) Ont+k = ai 1)! sjonge-j fork = 1. 
(db) sepi = eal (HDI sep joj forl<k<n-l. 
Proof 
(a) Let 


{OSC -2) ee ag ates. 067) 


2Carl Gustav Jakob Jacobi, German mathematician of the nineteenth century. 


414 16 Relations Between Roots and Coefficients 


Since z; is aroot of f, we have 
zp = siz; 2 aa (=1)"! sp—123 + (—1)"s, =0 


for 1 <i <n, and thus 


ge _ ic + er + iy ga + (—1)"s,z* _ 0. 


By adding the equalities above for 1 <i <n, we obtain 


—1 
Ont+k — S1Ontk—-1 Fee + (-1)” Sn—1Ok41 + (—1)"spox = 0, 


an equality that is equivalent to the stated one. 
(b) It follows from (16.7) that 


F(X) =X"! = = Ds X"F7 He (ED nt 
On the other hand, for z € C \ {z1,..., Zn}, Problem 3, page 393 gives 


p= tO 4.4 50 
ee Al ae 


If f; € CLX] is such that f(X) = (X — z;) fj (X), say 
Fi(X) = XP! 4 DX"? HH byt, j, 


we get 


f'@ oe yer Se Oe 
j=l 


j=l 


mnt! + (Sooy)er? ++ (br). 
j=l 


However, since the equality above is true for every z € C \ {z1,... 


Corollary 15.10 gives 


f'&) = nxn! +S /)xn2 4 +(D he 1.j): 


(16.8) 


Zn}, 


(16.9) 


Finally, by equating the corresponding coefficients in (16.8) and (16.9) and 


substituting identities (16.6), we obtain 
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n 
(Dn —k — Dow = Do bes, 
j=l 


=e - a tet CD ey) 


k+l 
= On41 — 810K Fee + (HDT nse, 
so that 


(K+ Lsep1 = 5k01 — 5e-102 ++°* + (DE soog st. 


The coming result is also due to Jacobi. 


Corollary 16.18 (Jacobi) For each integer k > 1, let s; denote the k-th elementary 
symmetric polynomial in X1,..., Xn, and o, = x feee fp x Then: 


(A) Ontk = War 1)I | sjonte-j fork = 1. 
(b) Ska = + it (- DI sep jo; forl<k<n-l. 


Proof The previous proposition assures that both sides of (a) and (b) are equal upon 
evaluation in arbitrary zj,...,Z, € C. Since C is an infinite set, Proposition 16.1 
guarantees the equality of the corresponding polynomials. Oo 


Problems: Sect. 16.3 


1. If f € Z[X] is a monic polynomial of degree n > 1 and complex roots 


Z1,+++, Zn, prove that a Sa zi € Z for every integer k > 1. 
2. Ifaj,..., an, b1,..., by are complex numbers such that 
Gp Ais ae mbt eas ee 
for 1 < k <n, show that {q,,...,a,} = {b1,..., Dy}. 
3. (Japan—adapted) Let n,k € N, with 2 < k <n, and qj,..., a, real numbers 
such that 
ayte +a = 
ay ee ns 


k k 
ajte:+a=n 


If p(X) = (X +a,)...(X +. ax), prove that p(X) = er ("xd 
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4. Letm € N, mw = cos at +i sin at and z1,..., Z, be complex numbers such that 


ff (X) = (X — 21) (X — 22)... (X — Zn) € ZX]. 


(a) If g(X) = i = f (w/ X), prove that 
g(X) = (xX — zy")... (X™ — zi"). 


(b) Show that the polynomial g also has integer coefficients. 
(c) Conclude that if z € C is a root of a nonzero polynomial with integer 
coefficients, then so is z”, for every m € N. 


5. (Brazil—adapted) 


(a) Given n € N, prove that there is at most a finite number of monic 
polynomials of degree n and integer coefficients, such that all of their 
complex roots have modulus 1. 

(b) Let f € Z[X] \ Z be a monic polynomial all of whose complex roots have 
modulus 1. Prove that all of them are roots of unity. 


Chapter 17 ®) 
Polynomials Over R spooks 


This chapter revisits, for real polynomials and departing from the fundamen- 
tal theorem of Algebra, some classical theorems of Calculus. As applications 
of them, we shall prove Newton’s inequalities, which generalizes the classical 
inequality between the arithmetic and geometric means of n positive real num- 
bers, and Descartes’ rule, which relates the number of positive roots of a real 
polynomial with the number of changes of sign in the sequence of its nonzero 
coefficients. 


17.1 Some Calculus Theorems 


In this section we establish, for real polynomials, some classical results on continu- 
ity and differentiability. For a more general approach, we refer the reader to [3] or 
[8], for instance. 

Our first result provides a sufficient way for the existence of real roots in a given 
interval. For the proof of it, we shall need the following auxiliary result, which 
appeared as Problem 6, page 386. Nevertheless, this time we give a different proof. 


Lemma 17.1 If f € R[X] \ {0} is mOme and has no real roots, then there exist 
polynomials g,h € R[X] such that f = g* +h. Inparticular, f (x) > 0 for every 
xeR. 


Proof Problem 4, page 386, assures the existence of complex nonreal numbers 
Z1,+++, Zk Such that 


(X= To — zj)(X —Z)). 


J= 
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Now, if Zj=ajt+ ibj, with aj, bj € R, then 


(Xx - Zj (Xx = Zj) = (xX -— aj — ibj )(X = 4) F ib;) 
= (X — aj)’ — (bj 
= (X - aj)? +d%, 
the sum of the squares of two real polynomials (one of them being constant). 


It now suffices to apply several times the analogous of Euler’s identity (12.8) for 
polynomials: for g1, g2, 41, h2 € RLX], we have 


(gf +A7)(g5 +3) = (gig2 + hike)” + (giha — gah). 
For what is left to do, it follows from the first part that 
f(x) = g(a)” + h(a)? = 0, 
for every x € R. However, since f has no real roots, the inequality above must be 


strict, for every x € R. oO 


The coming result is nothing but the intermediate value theorem for polynomials, 
being known in mathematical literature as Bolzano’s theorem. ! 


Theorem 17.2 (Bolzano) Jf f € R[X] anda < bare real numbers such that 
t(a@a f(b) < 0, then there exists c € (a, b) for which f (c) = 0. 


Proof Assuming, without loss of generality, that f is monic and f(a) < 0 < f(b), 
the last part of the previous lemma assures that f has at least one real root. Let 
a, < +--+ < a be the real roots of f, repeated according to their multiplicities. If 
g € R[X] is such that 


F(X) = g(X)(X — ay)... (X — ay), 


then g is monic and has no real roots, so that (again by the previous lemma) g(x) > 
0 for every x € R. 

By contradiction, suppose that no real root of f belongs to the interval (a, b), 
and separately consider the three following cases: 


(i) ag < a: we have 


f(@ = g(a\(a— ay)... (a— a) > 0, 


which is a contradiction. 


‘Bernhard Bolzano, German mathematician of the nineteenth century. 
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(ii) b < aj: it follows from f(b) > 0 that 
g(b)(b —ay)...(b— ax) > 0 


and, hence, k is even (for g(b) > O and b—a; < Oforl <i < k). On the 
other hand, since f(a) < 0, we get 


g(a)(a — a)...(a—ax) < 0, 


so that k is odd (for g(a) > O anda — a; < 0 for 1 <i < k). We have, thus, 
reached a contradiction. 

(iii) ay < a < b < aj41, for some | </ < k: we can arrive at a contradiction in a 
way similar to that of item (ii). For instance, 


Cat O20) G = Sie 6p Oa Gee) 
>0 >0 <0 


implies k — / even. 
oO 


Example 17.3 (Moldavia) Let f, g € R[X] be two given polynomials, each of 
which possessing at least one real root. 


(a) Prove that there exists a € R such that f (a)? = g(a). 

(b) If f4 + X + g(X)*) = g(1 + X + f(X)’), show that f = g. 

Proof 

(a) Let aw and 6 be real roots of f and g, respectively. We can assume, without any 


loss of generality, thata < 6B. If g(a) = 0 or f(6) = O, there is nothing to do. 
Otherwise, 


f(a)? — g(a)? = —g(a)? <0 and f(f)* — g(B)? = f(B)? > 0, 


so that Bolzano’s theorem, applied to the polynomial f(X)? — g(X)*, guaran- 
tees the existence of a € (a, 8) such that f(a)? _ g(a)* = 0. 

(b) Choose a € R as in item (a) and define a sequence (uy),>1 by letting uo = a 
and, for each integern > 1, Uno) = Ll+uynt+ g(Un)>. It follows from (a) that 
f(u1) = g(u1). Now assume, as induction hypothesis, that f(ux) = g(ux) for 
some integer k > 1. Then, 


f (wep) = fA + ug + gun’) 
= g(1+ue + f(ux)”) 
= g(1 tux + g(ug)) = g(ueqi)- 
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Thus, we have f (un) = g(uy) for every integer n > 1, with 
Untl = 1+un + g(un)* > l+tun > un 


for each such n > 1. Therefore, Corollary 15.10 gives f = g. 
oO 


In what comes next, we establish, for polynomials, Lagrange’s mean value 
theorem. 


Theorem 17.4 (Lagrange) Let f € R[X] \ {0} anda < b be given real numbers. 
Then, there exists a < c < b such that 


fie = LO=f@. 

b-a 
Proof We may assume that f # 0, and first show that if f(a) = f(b) = 0, then 
there exists a < c < b for which f’(c) = 0. Without any loss of generality, we 
can assume that f has no other roots in the interval (a, b). Indeed, since f 0, 
Corollary 15.9 assures that f has at most a finite number of roots in (a, b); then, if 
these are a, < a2 <--- < ax, it suffices to consider a, in place of b. 

If there exist a < c < d < b such that f(c)f(d) < 0, Bolzano’s theorem 
guarantees the existence of a root of f in the interval (a,b) which is an absurd. 
Hence, f has constant sign along the interval (a,b). Assume, without loss of 
generality, that f(x) > O for x € (a,b), and let k and / be the multiplicities of 
a and b as roots of f, respectively. Then, there exists g € R[X] such that 


f (X) = (X —a)(X — b)'g(X), 
with g(a), g(b) £ 0. Now, 
a<c<b=> fc) >0=> (c—a)*(c—b)'g(¢-) > 0S (-1)'g(c) > 0. 


We look at the case of an even / (that of odd / can be dealt with in an analogous 
way), so that g(c) > 0 for every c € (a,b). If g(a) < 0, then Bolzano’s theorem 
would assure the existence of a < d < ae such that g(d) = 0; in turn, this would 
give f(d) = 0, which is impossible. Therefore, g(a) > O and, likewise, g(b) > 0. 
We then have 


f'(X) = k(K — a) (X — by g(X) + 1K — a)k(X — by“! g(x) 
+ (X —a)*(X — b)'g'(X) 
= (X — a)¥!(x — by A(X), 
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with 
h(X) = (k(X — b) + 1(X — a))g(X) + (X — a)(X — b)g'(X). 


It now suffices to show that there exists c € (a, b) such that h(c) = 0. However, 
since 


h(a)h(b) = —kl(a — b)*g(a)g(b) < 0, 


an application of Bolzano’s theorem does the job. 
For the general case, let 


bs 
AiO = f(X) — fa) (2 LO) «x ay, 


b-— 


which is also a polynomial. Since f;(a) = f|(b) = 0, what we did above guarantees 
the existence of c € (a, b) such that fi (c) = 0. Finally, it is enough to note that 


flay = fray- (FORO) 


b-a 
a] 


The case f(a) = f(b) = 0 in the previous result preceded the general version 
due to Lagrange. For this reason, it is usually referred to as R6lle’s theorem, in 
honor of his discoverer, the French mathematician of the seventeenth century Michel 
Rolle. 


Corollary 17.5 (Rolle) Jf f € R[X] anda < bare real numbers such that f (a) = 
f(b) = 0, then there exists a < c < b for which f'(c) = 0. 


The coming example illustrates a typical use of Rdlle’s theorem. 


Example 17.6 Let f (X) = ag+a,X+--++ay_,X"~!+a,X" be areal polynomial 
such that 


Gn-1 an 


a a 
ee 0. 


i'2 “i” Re 


Prove that f has at least one root in the open interval (0, 1). 


Proof It is clear that f = g’, where g is the real polynomial 


g(X) = apX + SX? pp Sh 4 OA xm 
n 


n+1 


Since g(0) = g(1) = 0, Rolle’s theorem assures the existence of a € (0, 1) such 
that f(a) = g/(a) = 0. Oo 
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An important consequence of Lagrange’s mean value theorem is the study of the 
first variation (i.e., of the growth or decay) of real polynomials. This is the subject 
of the coming corollary. 


Corollary 17.7 Let f € R[X] \ {0} and I C R be an interval. 


(a) If f(x) > Ofor every x € I, then f increases in I. 
(b) If f(x) < Ofor every x € I, then f decreases in I. 


Proof Let us prove item (a), the proof of (b) being entirely analogous. For a < b in 
I, the mean value theorem furnishes c € (a, b) (and, hence, c € J) such that 


f(b) — f@ 


a = f'(c) > 0. 


In particular, f(b) > f(a). oO 


The coming example uses a little more Calculus than we discussed above. As 
before, we refer the reader to [3] or [8] for the necessary background. 


Example 17.8 (Sweden) Prove that, for every natural number n, the real polynomial 
i 2 1 n 
14+X+—X*+4+---+—-—xX 
2! n! 


has at most one real root. 


Proof Letting fp =1+X+ x +--+ %° we shall show that (i) Jn has no real 


n!? 
roots if n is even, and (ii) f,; has exactly one real root if n is odd. 

(i) Assuming n even, we have limjx|++400 fn(x) = +00. Hence, Weierstrass’ 
theorem (Theorem 8.26 of [8]) assures the existence of x9 € R such that f, 
assumes its minimum value at x9. Suppose, for the sake of contradiction, that 
fn(xo) < 0. Then, Problem 3 gives f/ (xo) = 0, and then 


re 

00> fn(x0) = Ff, (0) + ai = Pr i 
Since n is even, the inequality above successively gives x9 = 0 and f,(0) = 0, 
which is an absurd. 

(ii) Assuming n odd, Problem 5, page 386, gives an odd number of real roots for 
fn. On the other hand, since f/ = fn—1 > 0 (for — 1 is even), Example 15.33 
shows that f, has no multiple roots. By the sake of contradiction, suppose that 
Jn has at least two distinct real roots, say a < b. Then, Rélle’s theorem gives 
a € (a,b) such that f/(w) = 0. This way, fn—1(@) = 0, and this contradicts 
(i), for n — | is even. 

oO 
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Problems: Sect. 17.1 


1. If f € Z[X]\ Zand m e€ N is such that m > 1+ Re(z), for every complex root 
z of f, prove that | f(m)| > 1. 

2. * Prove, for real polynomials and with the methods of this section, the sign 
preserving lemma: if f € R[X] is such that f(a) > 0 for some a € R, then 
there exists r > 0 such that f(x) > 0 for every x € (a—r,a+r). 

3. * Given f € R[X] \ R, leta € R andr > 0 be such that 


f(@ =min{ f(x); x € (a-ratnr)}. 


Prove that f’(a) = 0. 

4. Let f(X) = X° — 2X* +2. Prove that f has exactly three real roots. 

5. We are given a real nonconstant polynomial f with positive leading coefficient. 
Prove that there exists ng € N for which 


u>v>no=> flu) > flv) >0. 
6. Prove that there does not exist a polynomial f € Z[X] such that f(n) is prime 
for every nonnegative integer n. 
7. (Leningrad) Decide whether there exist four distinct real numbers a, b, c and d 
such that, for any two of them, say x and y, we have 


OL Py tee tay? + yl = 1. 


8. The positive reals aj, a2, a3, aq are such that a) < a2 < a3 < ag and 
ajaza3a4 = 1. If A is areal root of 


reece aa))x- (D2). 


i=1 1<i<j<4 i 


Ir 
_ 


prove that A > ap. 

9, (Leningrad) Let a, b, c,d and e be real numbers such that the polynomial aX*-+ 
(c—b)X + (e—d) = Ohas a real root greater than |. Prove that the polynomial 
aX* + bX? 4+ cx? +dX +e = Ohasat least one real root. 


10. (Austrian-Polish) Given any real numbers a), ..., Gy), prove that 
ys Hz 
i,j=l bad 
with equality if and only if aj) =--- =a, =0. 


11. Let f(X) = X? — 3X +1. Compute the number of distinct real roots of the 
polynomial f(f(X)). 
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12. 


13. 


14. 


15. 
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(Sweden) Let f € R[X] be a polynomial of degree n. If f(x) > 0 for every 
x € R, show that 


FO+SFO+ SF’ +--+ f™@) = 0, 


also for every x € R. 
(Soviet Union) Two mathematicians A and B play the following game: it is 
given a polynomial of even degree greater than 4, 


F(X) = X™ + ayy 1X 4 ta X +1, 


with aj,...,@2,—1 real numbers to be chosen. The players, starting from A, 
alternate themselves in choosing the coefficients of f, until it is completely 
determined. At the end, A wins if none of the roots of f is real, and B wins 
otherwise. Find a winning strategy for B. 

At the beginning of a class, the teacher wrote a third degree real polynomial in 
the blackboard. Then, the students alternated themselves, each one performing 
one of the following operations with the polynomial written in the blackboard 
(which was then substituted by a new one): 


(a) Add, to the written polynomial, its derivative. 
(b) Subtract, to the written polynomial, its derivative. 


At the end of the class, the polynomial initially written by the teacher 
reappeared in the blackboard. Prove that at least one of the students made a 
mistake. 


(OIM, IMO—adapted) We are given real numbers aj, a2, ..., Gd, with O < 
a, <a). <-+- <a). If f: R\ {-a,...,—-a,} > Ris given by 
n a; 
j 
x)= ; 
fa=) 5 7 


j=l 


prove that the set {x € R; f(x) = 1} consists of the union of n bounded, 
pairwise disjoint intervals, with sum of lengths equal to aj + az + +--+ dn. 


17.2 Newton’s Inequalities 


In this section, we present a set of inequalities that refine the inequality between 
the arithmetic and geometric means of a finite set of positive real numbers (cf. 
Section 5.1 of [8], for instance—actually, there we give four different proofs of 
such an inequality). To this end, we first need the following auxiliary result. 
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Lemma 17.9 /f f € R[X] \ {0} has k real roots, counted with multiplicities, then 
f' has at least k — \ real roots, also counted with multiplicities. In particular, if all 
roots of f are real, then so are all of the roots of f’. 


Proof For the first part, we can assume that k > 1. Let aj < --- < q be the 
distinct real roots of f, with multiplicities respectively equal to m1, ..., mj , so that 
mi +---+m;, =k. We have already seen, in Proposition 15.31 that a; is a root of 
f’ with multiplicity m; — 1. On the other hand, Rélle’s theorem assures that f’ has 
at least one additional root between a; and aj+1; since there are / — 1 intervals of 
the form (a;, aj+1), we thus count at least 


(my —1) +--+0m-D)+d-Y)=k-1 


real roots for f’. 

For the second part, assume that f has degree n and n real roots, counted with 
multiplicities. Then, f’ has degree n — 1 and, by the first part, at least n — 1 real 
roots, also counted with multiplicities. Therefore, all roots of f’ are real too. Oo 


For what comes next, given > 1| real numbers a), ao, ..., Gd, and an index 
1 < j <n, we shall write S;(a,) to denote the j-th elementary symmetric sum 
of a}, ...,4,, and H;(a;) to denote the arithmetic mean of the () summands that 
compose Sj (ax), i.€., , 


Hj (ax) = Sj (ax) aa. > aj, ..-aj,. 


(j) ip<...<lj ‘ 


The coming result is usually credited to Newton, and inequalities (17.1) are 
usually referred to as Newton’s inequalities. For the sake of notation, we let 
Ho = 1. 

Theorem 17.10 (Newton) Letn > 1 be an integer and aj, a2, ..., An be given real 
numbers. If H\(ax), H2(ag), ..., Hj+i (ax) # O for some 1 < j <n, then 


Hy (ax)? > Hj-1 (ax) Hj+1 (ak), (17.1) 


with equality if and only if aj = az =... = an. 

Proof Let us make induction on n. For n = 2, we want to show that HP (a, an) => 
H(a1, a2) H2(a1, az) or, which is the same, (agey > a,az. But this is equivalent 
to (a, — ay)? > 0, which turns into an equality if and only if a; = ap. 

Assume, by induction hypothesis, that for any n — | given real numbers }j,..., 
by—1 such that Hy (by), H2(bx), ..., Hj+1(be) F 0 for some 1 < j < n—1, we 
have H; (by)? > Aj—-1(bg) Hj+1 (bx), with equality if and only if b} =... = dDy_1. 

We now consider n > 3 real numbers aj, a2, ..., dy, and let 


f(X) = (X+a1)...(X +an). 
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The previous lemma assures the existence of n — | real numbers bj, ..., b»—1 such 
that be € (ax, 4x41) for 1 < k <nand 


f'(X) =n(X +)1)...(X + bp-1). 
In particular, 


n—- 


1 / n—1 n—1 n—2 1 
=i (X) = Xx" + I ) mucirx oe asc a 1) Bib (17.2) 


On the other hand, the equality 
=< [n 
fH= > ( ‘A, (ay) x" 
=e 
i= 
furnishes 


n-1 


f(%) = Voa- a(") Hy (ax)X" J! 
j=0 " 


and comparing coefficients with (17.2) we obtain, for0O < j <n —1, 
Hj(a1,..-,4n) = Hj(b1, ..., On-1). 


Now, assume that Hy (ax), H2(ax),..., Hj41 (ax) 4 O for some 1 < j <n —2. 
Then, Hj (bx), H2(bx), ---, Hj+1(be) F O and, by induction hypothesis, for 1 < 
j <n—2we have 


H; (ag) = Hj (be) = Hj-1 (be) Hj41 be) = Hyj-1 (ae) Hj+1 an). (17.3) 


For the equality, if H; (ax) = Hj_\(ax)Hj+1(ax) for some 1 < j < n—2, 


then (17.3) gives H? (by) = Hj_-1(by)Hj+1(bx). Therefore, the condition for 
equality in the induction hypothesis furnishes b} = ... = by_1, and from this it 
is almost immediate (cf. Problem 1) to show that aj =... = dp. 

We are left to proving that, if Hy (az), Ho(ax),..., H,(ax) # O, then 


Te, (ax) => An—2(ax) An (ax), 


with equality if and only if aj = ... = a,. We shall prove that this is true provided 
we merely have H, (ax) 4 0. Indeed, in what concerns the desired inequality, it is 
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the same as proving that 


-——_ 
———~ 
3 
[| 3 
fas 
Seca 
| 
8 
8 
S 
3 
| 
i) 
IV 


—i| n =] 
n se, ae n 
= (,"5) Dodi Gi Gj dn (") orate 


Letting P = aj ...dn, we observe that to prove the above inequality is the same as 
to establish that 


or 


If we set a; = ;, we wish to show that 


2 
n 
(n — 1) (>>) > 2n aw, 
i=l i<j 
Denoting S = (n— 1) (77, aj)” — 2n )7; <; #i%j, We compute 


n 
S=(n- 1) Soo? + 2(n — 1) S0 cyjee; —2n So aia; 
i=1 


i<j i<j 


n 
=(n— 1) So a7 — 2) > ajay; 
i=1 


i<j 
=) @-aj)* > 0. 
i<j 
Now, it is clear that equality holds if and only if aj = ... = ay, i.e., if and only if 
ay =...=Gy. oO 


The coming corollary, due to the Scottish mathematician of the eighteenth 
century Colin McLaurin, embodies the promised refinement of the inequality 
between the arithmetic and geometric means. Inequalities (17.4) are then known 
as McLaurin inequalities. 
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Corollary 17.11 (McLaurin) Forn > 1 positive real numbers ay, a2, ..., An, We 
have 

Hy (ax) = V Ho(ax) = VH3(ax) =... = Wn (an), (17.4) 
with equality at each of the above inequalities if and only if ay = ... = dp. 


Proof For the sake of simplicity, write H; in place of H; (ax). Newton’s inequalities 
give us 


H? > HoH) = Hy and H3 > HH. 


Therefore, H} > fees > HH, so that He > Hy. In turn, this last inequality, 
together with the second inequality above, gives 


H3 > HH; > H,!° Hy = Hy” 
and, thus, 
H, > Hy! > H,”. 
Assume we had already proved that 


1/(k-1) 
1 


M2 Hs... HY) > A! 


for some positive integer k < n. It follows from Newton’s inequalities, together 
with the last inequality above, that 


k-1 


He > Ay-1 Hes > H,*® Aes, (17.5) 
a ; 2-(44 a ee 1/k 1/(k+1) 
which in turn gives us H, > Ag+ or, which is the same, H,"" = Ay . 
k-1 
For the equality, suppose we have Hy!* = Bos Then, He = HA,” Hee, 


and (17.5) shows that rts = Ay_—| Hy+1. Therefore, it comes from the condition for 
equality in Newton’s inequalities that aj =... = dy. Oo 


Problems: Sect. 17.2 


1. * Let f € R[X] \ {0} be a polynomial of degree n with n real roots, counted with 
their multiplicities. If f’ has n — 1 equal roots, show that f has n equal roots. 

2. We are given nonzero real numbers a, b, c, d such that 4b? < 9ac. Show that the 
polynomial X+ + aX? + bX? + cX +d has at least two nonreal roots. 
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3. Let a, b, c, d be positive real numbers. Prove that 
2(a+b+c+d)(ab+ac+ad+bce+bd+cd) > 3(abc + abd +acd + bcd), 


with equality if and only ifa =b=c=d. 
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The main result of this section relates the number of positive roots of a real 
polynomial to the number of changes of sign along the sequence of its nonvanishing 
coefficients. Among the results proved so far in this chapter, it depends only on the 
Bolzano theorem, so that we could have discussed it right after that. Up to details, 
our presentation follows [26]. 

We start with the following combinatorial lemma, which can be easily established 
by induction, for example. 


Lemma 17.12 For each finite sequence v of + and — signs, let V(v) denote the 
number of pairs of distinct consecutive signs in v. If A stands for the set of those 
sequences with initial and final distinct signs, and B for the set of those sequences 
with initial and final equal signs, then: 


(a) ve AS V(v) = |(mod 2). 
(b) v€ B= V(v) = O(mod 2). 


Now, we shall need the following definition. 


Definition 17.13 Let 
F(X) = ay X" + ay 1X" +--+ aX" + aX" 


be a nonconstant real polynomial, with k, > ky-1 >... > ki > ko = Oanda; #0 
for 0 < j <n. Define the sequence ve = (Qn, M1, ..., 1, @0) by setting, for 
each integer 0 <j <n, 


fos +, if aj >0 
I |=, if aj <0° 


The variation of f, denoted V(f), is the number of pairs of distinct consecutive 
signs in vf. 
For our purposes, the following one is a crucial result. 


Lemma 17.14 Let g be a nonconstant real polynomial and c > 0 be a given real 
number. If f (X) = (X — c)g(X), then V(f) — V(g) is an odd positive integer. 


Proof We make induction on dg > 1. 


430 


(a) 


(b) 
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dg = 1: we can surely suppose that g is monic. If g(X) = X, then f(X) = 
X? — cX, so that V(f) — V(g) = 1. Now, assume that g(X) = X — a, with 
a > 0. Then, f(X) = X*—(a@+c)X +ca, so that V(f) — V(g) = 2—-1=1. 
The case g(X) = X + a, with a > 0, is equally easy: f(X) = X* + (a — 
c)X — ca and, since —ca < 0, Lemma 17.12 assures that V(f) is odd; hence, 
V(f) — V(g) = V(J/), an odd positive integer. 

Assume that the result is valid for every nonconstant real polynomial g such that 
dg <n, withn > | being an integer, and take another such polynomial g, this 
time of degree n. For the sake of notation, for a given nonconstant real polyno- 
mial h, whenever necessary we shall denote by a, and Bp, respectively, the lead- 
ing coefficient and the coefficient of the term of lowest degree in h. In particular, 


h(X) = ayX" + +--+ BnX*, 
with r > s. We distinguish three cases: 


(i) All coefficients of g are positive: once again assume (without loss of 
generality) that g monic, say g(X) = X” +---+ 8X’, with/ <n. Then, 


FiDe oe a ee x, 


with cB < 0, and Lemma 17.12 shows that V(f) is odd and, hence 
V(f) — V(g) = V(J/) is an odd positive integer. 
(ii) There exist nonconstant polynomials u and v such that g =u + v, 


u(X) = ay X" +--+ + By X*, and v(X) = ayX? +---+ ByX4, 
withhn =r >s, p >qandp+1 <-s. Then, 
(X — c)g(X) = (X — c)u(X) + (X — c)u(X) 
= (a, Xt! 4+... — eB, X) (17.6) 
4 (Gy XP ae = op RD, 


since p + 1 < s. Let us now look at two distinct subcases: a, 6, > 0 and 
AyBy < 0. 


* dyB, > 0: we obviously have V(g) = V(u) + V(v). However, since 
(—cBy)ay < 0, the second equality in (17.6) shows that 


V(f) = V(X —c)g) = VOX —c)u) + V(X —c)v) +1. 


By the induction hypothesis (in principle, wu has degree r = n; 
nevertheless, s > 1 implies that we can apply the induction hypothesis 
to w(X) = a, X'~* +--+ + By, since u(X) = X*w(X)), we obtain 


V20+VQ)+0+V@) +1eVe)+1 
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and, modulo 2, 
Vif) =d+V@))+ 04+ Vo) +1= V(g) +1. 
* aby <0: since a6, <0, we have 
V(g) = V(u) + V(v) +1 
and, by (17.6), 
V(f) = V(X — c)g) = V(X — c)u) + V(X — cv). 

Once again from the induction hypothesis, we get 

V(f) = U+V@))+ 1+ Vv)) = V(g) +1 
and, modulo 2, 

Vifj=d+Vm@)+0+V@))= Ve) +1. 

(iii) 2(X) = ag X” + ape XO oes bee X*,, with ap... +, ae ~ D and not 


all of dn, Qn—1, ..., Ak positive. Assume, without loss of generality, that 
a, > O, and write 


g=so-git::-+(-l'g, 


such that each gj = ay foes Bj X" (ki; > 1) has only positive 
coefficients and /; = kj4; + 1 fori < t. Then, V(g) = ¢ and 


(X —c)g = (X—c)go— (KX -o)git--- +) (X - os: 
= (X —c)(apX™ +--+ BoX") 
— (X =e) X" 4 ..+4 pyX) 4... 
+ (=1)'(X = c)(aX" +--+ + BX") 
= apX't! 4... — (ay + cho) X +++» + (az + cB1)X"! 
pee (013 + CBa)X? + +++ (=I) (oy + By) X" 
tee (-1)t 1 cB, X", 


Since a, a, + cBho, a2 + chi, ..., & + cB;-1 and cf, are all positive, 
Lemma 17.12 assures that there is an odd number of changes of sign (and, 
hence, at least one) of consecutive pairs of coefficients in each of the t + 1 
dotted intervals above. This gives 


Vf) =V(X—c)g) =t+1=Ve) +1. 


432 17 Polynomials Over R 


It also follows from the above that, modulo 2, 


Vif) =V(X—c)g) =t+1=V(g) +1. 
Oo 


We can finally state and prove the following result, which is known in Mathe- 
matics as Descartes’ rule’ for polynomials with real coefficients. 


Theorem 17.15 (Descartes) Let f be a nonconstant real polynomial. If P(f) 
denotes the number of positive roots of f, then V(f) — P(f) is a nonnegative 
even integer. 


Proof We make induction on the degree of /. 

If df = 1, assume, without loss of generality, that f is monic. If f(X) = X or 
f(X) = X +a, witha > 0, then V(f) — P(f) =0-O0=0. If f(X) = X -a, 
with a > 0, then V(f) — P(f) =1-—1=0. 

Now, let n > 1 be a given integer and suppose that the theorem holds true for 
each nonconstant real polynomial of degree less than n. If 


F(X) = ap X" + ay X" | +--+ +.a1X tag 


is a real polynomial of degree, we have two different cases: 


e P(f) = 0: in this case, it suffices to show that V(f) is even. Since f(0) = ao 
and f(x) has the same sign as a, for every sufficiently large real number x (for 
limy— +00 fm = d,), the fact that P(f) = 0 assures, via Bolzano’s theorem, 
that a,a9 > 0. Hence, Lemma 17.12 guarantees that V(f) is even. 

¢ P(f) > 0: by taking a positive root c of f, there exists a nonconstant polynomial 
g such that f(X) = (X — c)g(X). Lemma 17.14 guarantees the existence of an 


odd positive integer J for which 


Vif) Vier d= Pee 
= (Ve) = Pa) au =I). 


Since, by induction hypothesis, V(g) — P(g) is nonnegative and even, we 
conclude that the same holds for V(f) — P(f). oO 


?René Descartes, French mathematician, philosopher and scientist of the seventeenth century. 
Descartes’ legacy to Mathematics and science is a huge one, and came mainly from his 
landmarking book Discours de la Méthode (Discourse on the Method) and its three corresponding 
appendices. This book marks a turning point on the way of doing science, for, along it, Descartes 
strongly rejected the scholastic tradition of using speculation, instead of deduction, as the central 
strategy for the investigation of natural phenomena. On the other hand, its appendix La Géométrie 
layed down the foundations of Analytic Geometry, and nowadays every student is acquainted with 
cartesian coordinate systems. 
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The coming corollary examines the case of negative roots. 


Corollary 17.16 If f is a nonconstant real polynomial and N(f) is the number of 
its negative roots, then V(f(—X)) — N(f) is nonnegative and even. 


Proof Immediate from Descartes’ rule, together with the fact that a « N(f) if and 
only if —a@ € P(f). Oo 


As a direct consequence of Descartes’ rule, if f is a nonconstant real polynomial 
such that V(f) is odd, then has at least one positive root. More particularly, if 
V(f) = 1, then f has exactly one positive root. The coming example takes 
advantage of this fact. 


Example 17.17 Let ao, a), ..., G,—1 be nonnegative real numbers, at least one of 
which being nonzero. Compute the number of positive roots of the real polynomial 


x” Gx St eS a, X — ag. 


Solution Denoting by f the given polynomial, Theorem 17.15 assures that V(f) — 
P(f/) is nonnegative and even. However, since V(f) = 1, it is clear that P(f) = 1. 
oO 


Problems: Sect. 17.3 


1. Let g € R[X] \ R be a polynomial of degree n and f(X) = g(X”). If f has 
exactly n positive roots and n negative roots, show that V(f) =n. 

2. For each n € N, show that the real polynomial X" — X"~! — x"-?—...— KX -1 
has a positive root a, satisfying 


1 
<a, 5 2-— 


2 : 
Qn 


~ gn-1 


3. (Leningrad) The third degree real polynomial aX? + bX? + cX +d has three 
distinct real roots. Compute the number of real roots of the polynomial 


A(aX? + bX? + cX +.d)(aX +b) — BaX? + 2bX +c)’. 


Chapter 18 m®) 
Interpolation of Polynomials spooks 


Corollary 15.10 assures that there is at most one polynomial of degree n and 
assuming preassigned values in n + 1 given complex numbers. What we still do not 
know is whether such a polynomial actually exists. For instance, does there exists a 
polynomial f with rational coefficients, degree 3 and such that f(0) = 1, f(1) = 2, 
f (2) = 3 and f (3) = 0? In this chapter we study a bunch of techniques that allow us 
to answer this and alike questions, and which are generically referred to as interpola- 
tion of polynomials. In particular, we shall study in detail the class of Lagrange inter- 
polating polynomials, which will then be used to solve Vandermonde’ linear systems 
with no Linear Algebra. In turn, the knowledge of the solutions of such linear 
systems will allow us to study, in Sect.21.1, an important particular class of linear 
recurrence relations, thus partially extending the methods of Section 3.2 of [8]. 


18.1 Bases for Polynomials 


For what follows, recall that K stands for any of the number sets Q, R or C. We 
shall firstly need to establish a quite useful notation. 


Definition 18.1 For € N and 1 <i, j <n, we let the Kronecker delta! 4; j be 
defined by 


1, ifi=j 
bij = pacts ie 
0, if i #j 
The advantage of Kronecker’s notation comes from the fact that we can use it as 
a tag, in the following sense: given n € N and a sequence (bj,..., b,) in K, we 
have 


‘Leopold Kronecker, German mathematician of the nineteenth century. 
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n 
S > bijbj = bi (18.1) 
j=1 


forl <i <n. 
Now, let be given n € N and pairwise distinct elements a1, a2, ..., dy in K. For 
1 <i <n, we define L; € K[X] by 


L(x) = T] X aj _ X-a, X — 4G; X — An 
: aj — aj aj — aj = aj — aj _ aj — An , 


l<j<n 


J#i 


with the hat —_ over a certain factor meaning that it is actually missing from the 
product. Such polynomials L; are called the Lagrange interpolating polynomials 
for the set {a,,..., dy}. 

It is immediate to verify that 


Li (aj) = bij 


for all 1 < i, 7 <n. In turn, as we shall see in the coming result, this property 
allows us to build polynomials attaining preassigned values at a set of pairwise 
distinct given elements of K. Such a result is usually referred to as the Lagrange 
interpolating theorem. 


Theorem 18.2 (Lagrange) Given n € N and qj,...,an,b1,...,bn € K, with 
a,..., a, being pairwise distinct, there exists exactly one polynomial f € K[X], 
of degree less than n and such that f (aj) = bj for 1 < i < n. More precisely, such 
a polynomial is 


f(X) = bjLj®, (18.2) 
j=l 


with the L; standing for the Lagrange interpolating polynomials for the set 
{a1, aston an}. 
Proof Uniqueness follows from Corollary 15.10. On the other hand, taking f as 
in (18.2), it suffices to show that df <n and f(a;) = bj forl <i <n. 

Since OL; = n—1 for 1 < j <n and the degree of a sum of polynomials, 
whenever defined, does not surpass the greatest degree of the summands, it is pretty 
clear that df <n. For what is left to do, just note that 


fGo= >) biga= > b8,=h), 


j=l j=l 


where we have used (18.1) in the last equality above. oO 
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The following corollary provides an equivalent way of formulating the previous 
result. 


Corollary 18.3 Letn € N and aj, ..., dy be pairwise distinct elements of KK. If 
Ff € K[X] \ {0} satisfies of <n, then 


F(X) = Yo f@)Li(X). 


i=1 


Proof Indeed, by letting 


g(X) = > f@)Li(X), 


i=1 
we have df, dg < n and, by the proof of Lagrange’s theorem, g(a;) = f(a;) for 
1 <i <n. Therefore, by invoking Corollary 15.10 once more, we getg = f. O 


We now collect some interesting applications of Lagrange’s interpolation, begin- 
ning with the operational aspects involved. 
Example 18.4 Find a polynomial f € Q[X] such that f(1) = 12, f(2) = 2, 
f@G) =1, f4@) = —6and f(5) = 4. 
Solution Firstly, let us explicitly write Lagrange’s interpolating polynomials for the 
set {1, 2, 3, 4, 5}: 


peep f 22 Pe FH eS 
cea eae al Geary) Cac) (ces) ces 


i#l 


i 
= vice = 14x? 4 91 X* = 154 + 120); 


1 YEVEDED 


j#2 


Lo(X) 


1 4 S 2 
= ~2(X4 — 13X° + 59X? — 107X + 60), 


OO I) a=; -() 45) G5) 3) 


J#3 


1 4 3 2 
= 7 (X*— 12x? + 49X? — 78X + 40), 
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_ py xin (XE!) (%22) (%23) (45 
BOS We Ga) aa) aea) ae) 


J#4 


1 4 3 2 
—g(X* — 11X? + 41X? — 61X +30) 


7 Xrd a (Xa!) (X22) (¥=3) (¥=4 
Oe cen (cea esr ea 


JAS 


and 


1 4 3 2 
5g (X" — 10X? + 35X? — 50X + 24). 


Hence, according to previous corollary we can take 


f(X) = 12L1(X) + 2L2(X) + L3(X) — 6L4(X) + 4L5(X) 
19:94. 55.24 233-4 2975 
SX x Xx X +86. 
iD a ee 


oO 


Example 18.5 (IMO Shortlist) Let f be a monic polynomial of real coefficients and 
degree n, and let x), x2,..., Xn+1 be pairwise distinct integers. Show that there exists 
an index 1 < k <n +1 for which | f(xx)| = a 


Proof Lagrange’s interpolation formula gives 


n+1 


f@) =) ¢@)[]——_. 
j=l 


nie, 
izj / 


Therefore, by comparing the leading coefficients on both sides of the equality above, 
we obtain 


ee. FED 


l= ee 
= Hizj@y — xi) 


Now, if M = max{|f(x,)|; 1< k <n-+ lj and p; = Hizj — x;), we have 


atl n+l nt+1 
ie ye ys ED Jie 
j=l Pj j=l Ip; ei IP; 
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n+l 1 
j=l |pjl ; : 
assume (after reordering the x;’s, if needed) that x} < x2 <--- < X,41. This way, 


However, since }~ is symmetric with respect to x1, x2,..., X,»+1, we can 


[pil = ej — x1)... yj — Xj) 4 — Xj) --- Ong — x7) 


>[VG-1)...2-1][1-2...@+1-—j)] 
n! 
= G- DIN +1- I= Te 


Cy 


In view of the estimates above, we finally get 


n+l n+l n 
remy SM als aor 
am IPjl ah j-1 n! 
or, which is the same, M > a oO 


We now show how to use Lagrange’s interpolating polynomials to solve certain 
types of linear systems of equations, known as Vandermonde systems. Apart from 
Example 18.7, Vandermonde systems will also play a role in the discussion of some 
special linear recurrence relations, in Sect. 21.1. 


Proposition 18.6 (Vandermonde) Given elements a, a2, ..., Gn, @1, 2, ..., An 
of KK, with ay, a2, ..., dn being pairwise distinct, the linear system of equations 

X1 X24 +++ + Xn = a} 

AX] + A2X2 1 ++ + AnXn = a2 

arx| + asXx2 fore Gdn =a (18.3) 


ie + ay~'xp +++ tay, = On 


admits a unique solution in K. In particular, if a, = +--+ = @p, = 0, thenx, =--- = 
Xn = 0. 


Proof If f(X) = co taX +--+ + Cn_1X"—! © K[X], then, by respectively 
multiplying the equations of the linear system by co, c1, ..., Cn—1 and adding the 
results, we obtain 


f(ay)xy +--+ + fan) Xn = cgay + cyag + +++ + Cy-1Mp. 


Now, if we let f = L;, where L; is the i-th Lagrange interpolating polynomial 
for {a,, a2, ..., a}, then the left hand side of the equality above reduces to x;, so 
that 


Xj = cod) +c1Q2 +---+cCp_-1dy € K. (18.4) 


oO 
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The coming example shows how to use the previous proposition to give an 
alternative proof for the generalized multisection formula (cf. Problem 8, page 355). 


Example 18.7 Let f € C[X] \ {0}, 1 > 1 be a natural number and0 <r <n-1 


be an integer. If w = cis on and z € C, show that 


n-1 


- a,zk - “yo flols), 
j=0 


k=r (modn) 


with a; denoting the coefficient of X* in f. 


Proof Writing 


f(X) = uo(X") + Xuy(X") + +X" ty 1 (¥"), 


with uo, U1, ..., Un—1 € CX], and taking into account that w” = 1, we obtain 
ug(z") + zuy(z") + + + 2" n_1(2") = f(z) 
ug(z") + wzuy(z") + +++ + (wz) uy_-1(2") = f(@z) 
uo(2") + (w°z)ui(2") ++ + (@2z)"Un-1") = f (wz) 


wo(2") +o"! z)ui (2) + + (12) Nn") = fo" 12) 


This is a Vandermonde system in uo(z"), zu1(z"), ...5 27! uy (z"), with 
fue > aa (18.5) 
k=r (mod n) 


According to (18.4), we have 
zur (2") = cof (z) ter f (wz) ++++ + cri fo" |2), (18.6) 


where cj; stands for the coefficient of X/ in the r-th Lagrange interpolating 
polynomial L, relative to {1, w, w,...,@"—!|}. However, 


(X=-1...%—o)... X08 


of =n —o\. ntet<—ely 


L;(X) = (18.7) 


with numerator equal to 


es = : = ai cas = yn-l + wo! X"-2 ueeee a ot x oe othr, 
—W —@OW 
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On the other hand, item (a) of Problem 5, page 393, computes the denominator 
of (18.7) as being equal to nw—)", so that 


(m=—1—j)r 1 : 
cj= Oy =-a@!". 
, na—Dr n 
It finally follows from (18.5), (18.6) and the above expression for c; that 


n—-1 n—-1 


dah = hej flo! = ~ yo" fos) 
j=0 


k=r (modn) j=0 
Oo 


A limitation of Lagrange interpolation lies in the fact that it only generates the 
set of nonzero polynomials having degrees less than or equal to some fixed natural 
number. We remedy such a situation with the following more general definition. 


Definition 18.8 A basis for K[X] is a sequence (fo, fi, f2,...) of elements of 
K[X] satisfying the following condition: for every f € K[X], there exist unique 
n € Zi andao,..., dn € K such that 


F(X) = ao fo(X) + +++ + an fn(X). 
Example 18.9 A simple way of constructing a basis for K[X’] is to take a sequence 
(fo, fi, fo, -.-) in K[X] such that df; = j for every j > 0. 


In order to show that such a sequence is indeed a basis for K[X], we have to 
establish the following two claims: 


(i) Ifn € N and ao, a,..., ay, and bo, bj, ..., by are elements of K such that 


ao fo(X) + a1 fi(X) ++ ++ +n fn(X) = bo fo(X) + 1 fi) +++ + bn fn(X), 


then a; = b; forO <i <n. 
(ii) For every f € K[X], there exist n € N and ao, aj, ...,d, € K such that 


f(X) = ao fo(X) + a1 fi(X) +--+ + an fr(X). 


We leave these checkings for the reader (cf. Problem 1). 


As a particular case of the previous example, note that the sequence 
Ce NP) 
is a basis for K[X] (as we already knew). 


In what follows, inspired by the binomial numbers, we construct a quite useful 
basis for K[X]. 
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Definition 18.10 For k ¢ Z,, we define the k-th binomial polynomial ({) by 
setting (a) = 1, G) = X and, for an integer k > 1, 

= a2 Re 1)...(% -—k+1) 

7 ig : 

Since a(*) = k, para todo k € Z4, a direct application of Example 18.9 
guarantees that the sequence of binomial polynomials is a basis for K[X], called 
the binomial basis. 

In spite of this, for future use we shall give a direct proof of the validity of 


condition (ii) of that example. More precisely, let us make induction onn > 0 
to show that there exist ag, a), ..., 4, € K for which 


x x x 
x" =a0( 5) +ai(7) +o an(?). (18.8) 
0 I n 


For n = 0 andn = 1, (18.8) holds due to the very definition of G) and =) Now, 


assume that for some k € N there exist ao, aj,..., ax € K satisfying (18.8) when 
n = k. Then, 
x x x k+1 
ao X+a, X+-++++ag X=X (18.9) 
0 1 k 
with 
xX 1 : rae 
; X= TX(KX-1)... K-F+DX-F+H) 
J J: 
1 . : 
= Fi cei eee lee an 


+ EX(K 1). (KF +) 


=U +0/ ~ )+iG) 
= fey ay 


Therefore, substituting this expression for (7)x (for 0 < j < k) into (18.9), we 
conclude that (18.8) is valid forn =k + 1. 

Even though the concept of basis for K[X], as formulated above, does not apply 
directly to polynomials over Z, when dealing with the binomial basis we have the 
following result. 


Proposition 18.11 Jf f € C[X] \ {0} is a polynomial of degree n such that f (0), 
f(),..., f(™) are all integers, then there exist unique ao, a1, ..., An € Z such that 


XxX xX xX 
FX) = a0(G) ta(P) +s +an(2). (18.10) 
0 1 n 


In particular, f (x) € Z for every x € Z. 
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Proof Since the binomial polynomials for a basis for C[X], there exist unique 
ao, 41,---,4, € C such that f can be written as in the statement of the proposition. 
Thus, it suffices to prove that ao, a), ..., Gn are integers. 

Arguing once more by induction, start by noticing that Z 5 f(0) = ag. Now, 
assume that we have already proved that ap,...,az—1 € Z, for some integer 1 < 
k <n. Then, (18.10) gives 


k-1 xX n xX 
£0) -Ya(FJ@=Da(*)w. (18.11) 


j=0 j=k 


For 0 < j < k, the evaluation (*) (k) coincides with the ordinary binomial number 


©) On the other hand, for k + 1 < j <n, we have 
xX 1 : 
(w= qkk-D..k- J+) =0 


Therefore, (18.11) reduces to 


k-1 He 
r)-Yra(1) = ar, (18.12) 


j=0 
and the fact that ag, a1, ..., ar—1 € Z, together with f(k) € Z, gives ag € Z. 
For the last part, since C) (x) € Z for every x € Z (cf. Problem 2), it follows 
from (18.10) that f(x) € Z for every x € Z. oO 


Also with respect to the proof of the proposition above, (18.12) can be written as 


k 


ponE (0 


j=0 
Therefore, a direct application of Lemma 2.12 furnishes 


k 
(k 
ax =) (- 1) (S\rw eZ 


j=0 


for 0 < k <n and with no inductive argument. 


Problems: Sect. 18.1 


1. * Complete the discussion of Example 18.9. 
2. * Given k € Z, prove that (i) (x) € Z for every x € Z. 
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3. (USA)* Let f be a polynomial of degree n, such that f(k) = ey for 0 < 
k <n. Compute all possible values of f(m + 1) in terms of n. 

4, (Singapore)’ We are given a polynomial f of degree n, such that f(k) = eal 
for every integer 0 < k < n. Compute all possible values of f(n + 1) in terms 
of n. 

5. Given an n-sided simple polygon P, show that there exists a polynomial f € 
R[X], of degree at most n — | and such that the graph of f contains the set of 
vertices of P. 

6. Given pairwise distinct real numbers a, b, c and d, solve the linear system of 
equations 


ax, t+a*x> + a°x3 +atx, = 1 
bx, + b?x2 + b3x3 4+ b4x4 = 1 
cxy + c2x2 + x3 +c4x4 =17 
dx; +. d*x.+ x3 +d*x4=1 


7. (USA) Let n > 3 be an integer and p, po, P1,-.-, Pn—2 be polynomials over R 
for which 


n—2 
S > X/ p(X") = (X™ 14 X" 7 4 + X + 1) p(X). 
j=0 


Prove that X — | divides p;(X), forO <i <n—2. 
8. Let n > 1 be a given natural number. 


(a) Factorise the polynomial 
n 
-{(n —— 
f(M= Dev ‘xox LD) cc KX + fle (Xn) — al. 
j=0 ? 
(b) Conclude that for each k € N one has 


eo) 
j=0 GJk+g  kk+1)...ce+n) 


(c) Compute, in terms of n, the sum of the series 


1 
) FEED. aE 


k>1 


>For another approach to this problem, see Problem 4, page 450. 
3For other approaches to this problem, see Problem 11, page 386, and Problem 3, page 450. 
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9. (Leningrad) A finite sequence a1, a2, ..., dy, is said to be p-balanced if, for k = 
1,2,..., p, all sums of the form 


ak + Aktp + Ak42p +> 


are equal. Prove that ifn = 50 and the sequence (a;)1<j<so is p-balanced for 
p = 3,5, 7, 11, 13 and 17, then all of its terms are equal to 0. 


18.2 Finite Differences 


Another useful, though a little more elaborate, interpolation technique is that of 
finite differences. We shall sketch the rudiments of it along this section. 


Definition 18.12 Let h be a real number and f : R — R be a given function. 
For an integer k > 0, we define the k-th finite difference of f with step h as the 
function At f:R-R, given by: 


(a) ARf =f. 
(b) (Aj f(x) = (Anf)(@) = fe +h) — f() for each x € R. 
(c) At f = A,(At' f) fork > 2. 


For such a definition to be useful, we need the properties of At f collected in the 
coming result. 


Proposition 18.13 Jn the notations of the previous definition, given h € R and 
functions f, g : R = R, we have that: 

(a) If f is constant, then Ay f = 0. 

(b) Ifa and b are real constants, then Ay;(af + bg) = aAnf + bAng. 


(c) An( fg) = (An f)(g + Ang) + fAng. 
(d) AK f = Akl (A, f), for every k EN. 
(e) Ifk > Oand x ER, then 


k 
(k 
(A, AG) = OD! ({)r0 + (k = j)h). 
j=0 


Proof 


(a) For x € R, we have A; f(x) = f(x +h) — f(x) =O, since f is constant. 
(b) For x € R, we can write 


(An(af + bg))(x) = (af + bg) +h) — (af + bg)(x) 
= a(f(x +h) — f(x) + b(g@ +h) — g@)) 
= a(An f)(x) + b(Ang)(@). 
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(c) Exercise (see Problem 1). 

(d) Let us make induction on k > 1, the case k = 1 following directly from 
Definition 18.12. By induction hypothesis, assume that the result is true when 
k =1> 1. Fork =1-+ 1, successively applying item (c) of Definition 18.12, 
the induction hypothesis and once more item (c) of Definition 18.12 (but this 
time to A, f in place of f), we obtain 


Alt f = An(Al f) = An(AL | (Anf)) = Ah (Anf). 


(e) Let us make induction on k > 0, noticing that if k = 0 and x € R, then 


re vi £0 =(- »°(4 ) Fo) = f(x) = (ARNG). 


Now, assume that the formula holds when k = / > 0, and let us prove its 
validity for k = 1 + 1. Given x € R, write A = (A!t! f)(x). By successively 
applying item (c) of Definition 18.12, the induction hypothesis and Stifel’s 
relation, we obtain 


A = Ag(AL f)@) = (AL AG +h) — (ALA 


l 


: (i 
= 2 -vi(\s wth +d= pm - Pew (fo +0- pm 


j=0 


=fixtd+ DHE 1) ie )-G ' .)) fat@+1-— sh) 


j=l 


“De vi(’ fob pm 


I 
=2 4 -(% ') fe 1— j)h) 


l l 


. l (1 
-)\(-1)/ i. - |) fore - im -Y-v'(4) Fo + (I- jh), 


j=l j=0 
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Hence, 
i+41 Tia 
al pay = cw feta t ip Cnt fey 
j=0 
1-1 (I I (Id 
*Devi(A)fo+0-pH-ev!(F)ro+d-pm 
j=0 j=0 
i+] 
=e re +a+1- pm 
j=0 
I fi 1 /I 
se i()rete-am Dew )roto-pm 
j=0 j=0 : 
Il 
= Dewi") r04 04 1- pm. 
j=0 
Oo 


Among the properties of finite differences listed in the previous proposition, the 
one which is more frequently used in interpolation problems is that of item (e), 
mostly when combined with the coming simple result. Note that the proposition 
above refers to finite differences of general real functions, whereas the next one is 
specifically related to finite differences of polynomials. 

We start by extending Definition 18.12 to polynomials, in the obvious way: for 
fixed h € R and f € R[X], given an integer k > O we define the k-th finite 
difference of f with step h as the polynomial AE f € R[X] given by: 


G@) AR =f. 
(i) (A, ACO = (An f)(X) = f(X +h) — F(X. 
(ii) Ak f = An(A‘! f), for k > 2. 


In view of the above definition, it is now immediate to check that items from (a) to 
(d) of the previous proposition still hold for finite differences of polynomials. 


Proposition 18.14 Let f © R[X] be a polynomial of degree n > 1. For an 
arbitrary fixed step h € R \ {0}, the n-th finite differences of f all equal, whereas 
the l-th ones, for any | > n, vanish identically. 


Proof Start by observing that, if f(X) = a,X" +---+a,X + ao, then 
(An f)(X) = an(X +h)" +--+. +a1(X +h) + a9 
— (anX" + +++ +a,X +9) 
= na,hX" 14 2(X), 
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with either g = 0 or 0g < n — 2. Since h ¥ 0, we conclude that A; f has degree 
n — |. Therefore, arguing by induction on the degree of a polynomial, item (d) of 
the (extension, for polynomials, of the) previous proposition guarantees that A) f = 


De Kg J) has degree 0, so that it is constant. Hence, A! f(X) =Ofor)]>k. O 


As will be clear from the two coming examples, the main usefulness of the 
material on finite differences discussed above lies on the fact that, if the interpolation 
points form an arithmetic progression, then the method of finite differences allow 
us to compute the value of the corresponding polynomial at any point of such a 
progression. 


Example 18.15 Let f € R[X] be a polynomial of degree m > 1, such that f(j) = 
r/ for 0 < j < m, with r being a given positive real number. Compute all possible 
values of f(m+ 1). 


Solution Set h = 1 and write A‘ f to denote At f (either the polynomial or its 
corresponding polynomial function). Item (e) of Proposition 18.13 gives 


k Stk 
A f(x) = Devi) 6 +k—j) 


j=0 


for every x € R. However, since df = m, the previous proposition then gives 


m+1 
(=A FO = yew" ') fom en 7). 
j=0 


Therefore, 
m+1 


fim+) = ewn("* ') fem pie 7) 


j=l 


m+1 
= Denn(" i ae 
J 


j=l 


m+1 
a perl yo(-p/ (” = ae 
J 


j=0 


- pnt _ (r 7 i, 


where we used the binomial expansion of (r — 1)’"*! in the last equality above. O 


Example 18.16 Let f be a polynomial of degree 1992, such that f(j) = 2/ for 
1 <i < 1993. Compute the remainder upon division of f (1994) by 1994. 
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Solution Combining item (e) of Proposition 18.13 with Proposition 18.14, and 
writing again A‘ f to denote Me ft, we have 


1993 
0 = Fy = wi) fas94 - a, 


j=0 


Hence, 
1993 


f(1994) = eae Oe ee 


j=! 


1993 


1993) . 1994 () 1993—j 
= 21994 _ 9 S\(—1)/ 21993—j 
( 0 ) ae ae 


j=0 
= 21994 _ 99 — 1)1993 = 91994 _ 9 
For what is left to do, note first that 997 is prime (this can be easily checked with 
the aid of Eratosthenes sieve, exactly as we did in Example 6.37 to show that 641 is 


prime). Hence, Fermat’s Little Theorem 10.15 assures that 2996 = | (mod 997), so 
that 


21992 _ (2996)? = 1 (mod 997). 
From this, it is immediate to get 


(1994) = 2" = 9 = 9? = 2 = 20d 997). 


We are then left with the system of linear congruences 


ce = 0 (mod 2) 
f (1994) = 2 (mod 997) ” 


which has, by the Chinese Remainder Theorem 10.27, a single solution modulus 
2-997 = 1994. However, since 2 is clearly a solution, we obtain 


f (1994) = 2 (mod 1994). . 


Problems: Sect. 18.2 


1. * Complete the proof of Proposition 18.13. 
2. Prove that, for each real function f : R > R, we have 


k 


k > 
fath=>- (‘Joos F(x), 


j=0 


for all h, x € R and every integer k > 0. 
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ake 


3. (Singapore)* Let f be a polynomial of degree n, such that f(k) = Fal 


integer 0 < k <n. Compute f(n + 1). 

4. (USA)? Let f be a real polynomial of degree n, such that f(k) = Gay for 
0 < k <n. Compute all possible values of f(n + 1). 

5. (MO—shortlist) A polynomial f, of degree 990, is such that f(kK) = Fy for 
992 < k < 1982, where Fy stands for the k-th Fibonacci number. Prove that 
f (1983) = F933 — 1. 

6. (MO—shortlist) Let f be a real polynomial such that 
(a) fl) > FO) > 0. 
(b) f(2) > 2fC) — FO). 
(c) FB) > 3f@2) —3fW) + FO). 
(d) fm~+4) >4f(4+3)-—6f(1+2)+4f(a+ 1) — f(M™) for every n € N. 


Prove that f(x) > 0 for everyn EN. 


for every 


4For other approaches to this problem, see Problem 11, page 386, and Problem 4, page 444. 
>For another approach to this problem, see Problem 3, page 444. 


Chapter 19 m®) 
On the Factorisation of Polynomials cen 


The division algorithm for polynomials provides a notion of divisibility in K[X] 
when K = Q, R ou C, and such a notion enjoys properties analogous to those of 
the corresponding concept in Z. It is then natural to ask whether there exists some 
notion of primality in K[X], which furnishes some sort of unique factorisation with 
properties similar to the unique factorisation of integers. Our purpose in this chapter 
is to give precise answers to these questions, which shall encompass polynomials 
with coefficients in Z,, for some prime integer p. 


19.1 Unique Factorisation in Q[X] 


Along this section, K denotes any of Q, R or C. 
We say that f, g € K[X]\ {0} are associated (in K[X]) if there exists a € K \ {0} 
such that f = ag. For instance, the real polynomials 


1 
FOG Sak? Ok 1 and eC S90 ak 
J2 
are associated in R[X], for f = /2g and /2 € R \ {0}. 
Given f, g € K[X i {0}, we say that a polynomial p € K[X] \ {0} is acommon 


divisor of f and g if p | f, g. Note that f and g always have common divisors: the 
constant nonzero polynomials over K, for example. 


Definition 19.1 Given f, g € K[X] \ {0}, we say that d € K[X] \ {0} is a greatest 
common divisor of f and g, and denote d = gcd(f, g), if the two following 
conditions are satisfied: 


(a) d| f,g in K[X]. 
(b) If d’ € K[X] \ {0} divides f and g in K[X], then d’ | d in K[X]. 
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The coming result is the analogue, for polynomials, of Theorem 6.13. For this 
reason, it is also known as Bézout’s theorem. In order to properly state it, given 
f € KLX] we let f K[X], or simply (f), if there is no danger of confusion, denote 
the subset of K[X] formed by the multiples of f, i.e., 


(f) = f KIX] = {af; a € K[X}}. 
Theorem 19.2 (Bézout) Let f, g € K[X] \ {O}. /f 
S = {af + bg; a,b € K[X}}, 


then there exists a polynomial d € K[X]\ {0} satisfying the following conditions: 


(a) S = (d). In particular, d | f, g in K[X]. 
(b) Every polynomial in K[X] \ {0} that divides f and g also divides d. 


Moreover, such a polynomial d is unique, up to association. 
Proof 
(a) Let d € S \ {0} be such that 

dd = min{dh; h € S \ {O}}. 


We first claim that S = (d). Indeed, letting d = ao f + bog, with ao, bop € K[X], 
and c € K[X], we have 


cd = (cag) f + (cbo)g € S, 


ie., (d) C S. Conversely, take h € S, say h = af + bg, witha, b € K[X]. By 
the division algorithm, we have h = dq +r, with g,r € K[X] andr = 0 or 
0 < or < dd. However, if r 4 0, then dr < dd and 


r =h-—dq 
= (af + bg) — (aof + bog) 
= (a—aoq) f + (6 — bog)g € S, 


a contradiction to the minimality of the degree of d in S. Therefore, r = 0 and, 
hence, h = dq € (d). 

For the second part of item (a), it suffices to see that f, g € S = (d), so that, 
in particular, f and g are multiples of d in K[X]. 
Let d’ € K[X] \ {0} be a polynomial dividing f and g in K[X], say f = d' fy 
and g = d’g, with f), g; € K[X]. Ifa, b € K[X], then 


(b 


wm 


af + bg = (afi + bgi)d' € d’ K[X]. 
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However, since af + bg is a generic element of S, it follows that 
de (d)=SC(d); 


in particular, d is a multiple of d’, as we wished to show. 
The last claim is left as an exercise for the reader (cf. Problem 1). oO 


In the notations of the statement of Bézout’s theorem, given f, g € K[X] \ {0} 
and letting d € K[X] \ {0} be such that d K[X] = S, from now on we say that d is 
a greatest common divisor of f and g, and write 


d = gcd(f, g). 


Although a gcd of two nonzero polynomials is not unique, thanks to the last 
part of Bézout’s theorem any two gcd’s of them are associated. Thus, whenever 
convenient we can take d to be monic, and we shall adopt such a usage from now 
on without further notice. 

Bézout’s theorem assures that if d = gcd(f, g), then there exists a,b € K[X] 
such that 


d=af+bg. 


Nevertheless, the proof of that result gave no clue on how to effectively compute 
such polynomials a and b. This issue will be dealt with in Problem 3, when we 
show that Euclid’s algorithm remains true for polynomials. 

Keeping with the parallelism to the notion of gcd in Z, we say that polynomials 
Sf, g € K[X]\ {0} are relatively prime if gcd(f, g) = 1. We then have the following 
important consequence of Bézout’s theorem. 


Corollary 19.3 If f, g € K[X] \ {0}, then f and g are relatively prime if and only 
if there exist polynomials a, b € KK[X] for which af + bg = 1. 


Proof If gcd(f, g) = 1, then the existence of a, b € K[X] as stated above follows 
directly from Bézout’s theorem. 

Conversely, if d = gcd(f, g) and there exist a, b € K[X] such that af + bg = 1, 
then, again from Bézout’s theorem, we have 1 € S = d K[X], ie., d is a divisor of 
the constant polynomial |. Therefore, d = 1 up to association, so that 1 is a gcd of 
f and g. Oo 


Corollary 19.4 Let f, g € K[X] \ {0} be relatively prime and h € K[X] \ {0} be 
such that dh < 0( fg). Then, there exist a,b € K[X] such that a = O or 0a < 0g, 
b=0or db < of andaf + bg =h. 


Proof By the previous corollary, there exist a}, b; € K[X] such that a; f+big = 1. 
Then, letting az = ajh and bz = byjh, we have a2 f + bog = h. Now, the division 
algorithm gives a2 = gq + a, witha = Oor 0 < da < dg. Thus, 


h=(gqta)f+hg =af + Gf + bag 
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and, letting b = gf + b2, we have h = af + bg, with a = 0 or 0a < dg. Finally, 
since bg = h — af, if b #4 0 we can compute 


ab + dg = A(bg) = A(h — af) < dh < A( fg) = Of +g, 


so that 0b < Of. oO 
The coming definition is absolutely central for the rest of this chapter. 


Definition 19.5 A polynomial p € K[X] \ K is irreducible over K if p cannot 
be written as the product of two nonconstant polynomials in K[X]. A polynomial 
Pp € K[X] \ K that is not irreducible is said to be reducible over K. 


In general, it is useful to ae the irreducibility condition contrapositively. 
This way, a polynomial p € K[X] \ K is irreducible if and only if the following 
condition is satisfied: 


p=gh, with g,h Ee K[X] >geKorheK. (19.1) 


Let us now take a look at two simple examples. 


Example 19.6 It follows immediately from (19.1) that every polynomial p € 
K[X] of degree 1 is irreducible. Indeed, letting p = gh, with g,h € K[X], 
Proposition 14.9 assures that dg + dh = dp = 1 and, hence, dg = 0 or dh = 0, i.e., 
g or h is constant. On the other hand, the Fundamental Theorem of Algebra 15.23 
guarantees that the irreducible polynomials in C[X] are precisely those of degree 1. 


Example 19.7 If p € R[X] is irreducible over R, then dp = 1 or 2. Indeed, 
according to Problem 4, page 386, a complex number z is a root of p if and only if 
so is z; then, let us consider two cases: 


(a) dp > 3 and p has at least one real root, say a: the root test (cf. Proposition 15.3) 
allows us to write p(X) = (X — a)h(X), for some h € R[X] of degree greater 
than or equal to 2, and p is reducible over R. 

(b) dp => 3 and p has two complex nonreal conjugate roots, say z and Z: by invoking 
the root test again, we have p(X) = (X — z)(X — z)h(X), for some h € C[X] 
of degree at least 1. However, if z = a+ bi and g(X) = (X — z)(X — 2), then 


g(X) = (X —a—bi)(X —a+bi) = (X —a)’ +b’ € R[X]. 


Hence, upon dividing p by g, the division algorithm assures that h € R[X]. 
Therefore, p = gh, with g, h € R[X] \ R, and p is reducible over R. 


Thanks to the last example above, from now on we shall restrict our attention 
to the irreducibility of polynomials over Q. In this sense, an argument analogous 
to that of item (a) of the last example (cf. Problem 5) allows us to conclude that if 
DP € Q[X] has degree 2 or 3, then p is irreducible over Q if and only if it does not 
have a rational root. 
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Back to the general concept of irreducible polynomial, note that if p € Q[X]\Q 
is irreducible, then the only divisors of p in Q[X] are the constant polynomials and 
those associated with p. This being said, we have the following important auxiliary 
result, which is the analogue of Lemma 6.41 for polynomials. 


Lemma 19.8 Let p € Q[X]\ Q be irreducible. If f,,..., fk € QLX] \ {0} are such 
that p | fi... fx, then there exists 1 <i <k such that p | fj. 


Proof By induction on k, it suffices to show that if p | fg, with f, g € Q[X] \ {0}, 
then p | f orp | g. If p { f, we first claim that gcd(f, p) = 1; indeed, if d = 
gcd(f, p), then d | p, so that either d € Q or d is associated with p in Q[X]. 
However, if this last possibility holds, it follows from d | f that p | f too, which 
is a contradiction. Therefore, d € Q and, up to association, we can suppose that 
d=1. 

Now, Corollary 19.3 guarantees the existence of polynomials a, b € Q[X] such 
that af + bp = 1, and hence 


a(fg)+ (bg)p = g. 


Since p | (fg), it follows from the equality above and Problem 6 that p | g, as 
wished. Oo 


We shall now establish an analogue of the Fundamental Theorem of Arithmetic 
for polynomials over Q, thus showing that every polynomial f € Q[X] \ Q can be 
uniquely written, up to association, as a product of a finite number of irreducible 
polynomials. 


Theorem 19.9 Every polynomial f € Q(X] \ Q can be written as a product of a 
finite number of irreducible polynomials over Q. Moreover, such a aan 
of f is unique in the following sense: if pi,..., Pk, di,---,q1 © QLX] \ Q are 
irreducible and such that p,... pk = qi .--q, thenk =1 and, up to reordering, pj; 
and qj are associated in Q[X]. 


Proof For the existence part, let us make induction on df, the case df = 1 being 
immediate (for we have already seen that, in this case, f is irreducible). As induction 
hypothesis, assume that the result is true for every nonconstant polynomial with 
rational coefficients and degree less than n, and take f € Q[X]\Q such that df =n. 
If f is irreducible, there is nothing left to do. Otherwise, we can write f = gh, with 
g,h € Q[X] \ Q. Thus, dg,dh < n, and the induction hypotheses guarantees 
that each of g and h can be written as a product of a finite number of irreducible 
polynomials with rational coefficients, say g = pj)... pj andh = pj4+1... px. 
Then, f = gh = p|... pj Pj+1--- Pk, a product of a finite number of irreducible 
polynomials over Q. 

For the uniqueness part, let pj... pe = qi... qi, With pi,..., Pk, Gis---5 GU € 
Q[X] \ Q being irreducible over Q. If kK = 1, we have pj = q,...qi, and the 
irreducibility of p; assures that / = 1. Similarly, = 1 > k = 1. Then, suppose 
that k,/ > 1; since px | qi ...q:, Lemma 19.8 guarantees the existence of an index 
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1 < j </ such that px | g;. Assume, without loss of generality, that j = 1. Since q; 
is irreducible over Q and px ¢ Q, the only possibility is that pz and q; be associated 
in Q[X], say px = uqi, for some u € Q \ {0}. Then, 


Piss Peat = Gs Qi-kgia1 = Gy +o G_4s 


with g; = qj for 1 <i <1 —2 and q;_, = uq-1, all irreducible over Q. 

Arguing by induction on max{k, /}, the induction hypothesis gives k — 1 =1-—1 
and, up to reordering, p; associated to q/ for 1 < i <1 — 1. Hence, up to reordering 
p; and q; are also associated over Q, for 1 <i <k. oO 


We can summarize the previous result by saying that one has unique factorisation 
in Q[X]. As within Z, if f € QLX] \ Q is factorised as 


f = Pi--+ Pk, 


with pi,..., pk € QLX] \ Q irreducible, then, by gathering together the irreducible 
factors p; which are equal up to association, we obtain 


fSo; ag. (19.2) 


with g1,...,q € QLX] \ Q irreducible and pairwise non associated, and a, ..., 
a; € N. Expression (19.2) (which is also unique up to association and reordering) 
is the canonical factorisation of f in Q[X], and q,...,q) are the irreducible 
factors of f in Q[X]. 


Problems: Sect. 19.1 


1. * Complete the a of Theorem 19.2. 
2. Let f, g € Q[X] \ Q be polynomials with canonical factorisations 


I 


a a Bi Bi, 
fo) oe ee, and g = pf)... Br po 


-Pr Ty ++-4m > 


with pj,..., Pk, G1,---,q, andr}, ..., 7, being monic, irreducible and pairwise 
non associated, and @;, a, B;, Bi € N. Prove that 


gcd(f, g) = pj’... pi, 


with y; = min{a;, 6;} for 1 <i <k. 
3. * Prove the following version of Euclid’s algorithm for polynomials: given f, g € 
X] \ {0}, with of > dg, the division algorithm in K[X] gives 
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Step 1 :f = gqat+n 0 < dr, < 0g 
Step 2 1g = rigaztr2 0 < Oro < Or) 
Step 3 fy) = 1293 +13 0 < 0r3 < Oro 
Step j DVj-2 = Vj-14j +7; 0 < or; < Orj-1 


Step j+1:rj-1= rjqj4i +0 


As in Z, note that the execution of the algorithm actually stops after a finite 
number of steps. Indeed, since rj, 72,... € K[X] satisfy 0g > dr) > dr2 > 
. = O, there must exist a least index j such that r; # 0. Show that r; = 

gcd(f, g). 

4. Compute ged(X+ — X? — 3X7 4+ 2X +3, X° — X34 4x? — 6X — 12). 

5. * If p € Q(X] is a polynomial of degree 2 or 3, prove that p is irreducible over 
Q if and only if p has no roots in Q. 

6. *If f, g,h € Q[X] \ {0} are such that f | g, h, prove that f divides ag + bh, for 
alla, b € Q[X]. 

7. If f € R[X] \ R has degree greater than 1, prove that f can be written, uniquely 
up to association, as a product of a finite number of irreducible polynomials over 
R. 

8. Prove Fermat’s last theorem for polynomials: if f, g, ¢ R[X] \ R have no 
nontrivial common factors, and are such that 


F(X)!" + g(X)" = h(X)", 


then n = | or 2. 
9. The purpose of this problem is to prove the partial fractions decomposition 
theorem. To this end, let K = Q, R or C, let f, g € K[X] \ {0} be given and 


think of £ as the rational function that sends each x € K \ g~!(0) to ‘ rt ; 


(a) If of < dg and g = gi'...g;* is the canonical decomposition of g in 
irreducible polynomials in K[X] (established in the previous problem if K = 
IR), prove that there exist f,..., fx € K[X], with fj = 0 or df; < d(g;") 
for 1 < j <k, and 


(b) If g is irreducible over K and k > 1 is an integer, prove that there exist 
polynomials q,r1,...,7¢ € K[X], withr; = 0 or dr; < dg forl < j <k, 
and 
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19.2 Unique Factorisation in Z[X] 


In the previous section, we studied the problem of unique factorisation for poly- 
nomials with rational coefficients. In this section, we extend the analysis of this 
problem to polynomials with integer coefficients. In this respect, the coming 
definition will be crucial for all that follows. 


Definition 19.10 The content of f € Z[X] \ {0}, denoted c(f), is the gcd of 
the nonvanishing coefficients of f. If c(f) = 1, we say that f is a primitive 
polynomial in ZX]. 


For the sake of notation, if f = a,X" +---+a,X + ao € Z[X] \ {0}, we shall 
write 


c(f) = ged(ao,..., Gn). 


The coming lemma, whose proof we leave to the reader as an exercise (cf. 
Problem 1), establishes two useful, albeit simple, properties of the content of 
polynomials with integer coefficients. 


Lemma 19.11 


(a) If f € ZX] \ {0} anda € Z \ {0}, then c(af) = |a| - c(f). In particular, there 
exists a primitive g € Z[X] \ {0} such that f = c(f)g. 

(b) If f € QX] \ {0}, then, up to multiplication by —1, there exist unique relatively 
prime nonzero integers a, b and g € Z[X] primitive such that f = (a/b)g. 


The importance of the concept of content for polynomials in Z[X’] lies in the key 
role it plays on the study of irreducibility of polynomials over Z, as we shall now 
define. 


Definition 19.12 A polynomial p € Z[X]\ Z is irreducible! over Z if p is 
primitive and cannot be written as the Ae of two nonconstant polynomials of 
ZX]. A primitive polynomial p € Z[X] \ Z which is not irreducible is said to be 
reducible over Z. 


As in the previous section, it is useful to rephrase the irreducibility condition 
for integer coefficient nonconstant polynomials contrapositively, so that a primitive 
polynomial p € Z[X] \ Z is irreducible if and only if the following condition is 
satisfied: 


p=egh, with g,heZ[xX)>g=+lorh=H1. (19.3) 


The coming result is known in mathematical literature as Gauss’ lemma. 


' Although this definition is slightly more restrictive than that usually found in most textbooks, it 
will be sufficient for our purposes. 
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Proposition 19.13 (Gauss) For f, g € Z[X] \ Z, we have: 


(a) c(fg) =c(f)c(g). In particular, fg is sbleee if and only if’ so are f and g. 
(b) If f is primitive, then f is irreducible in Z[X] if and only if it is irreducible in 


QL]. 
(c) If f and g are primitive and associated in Q{X], then f = +g. 
Proof 
(a) Lemma 19.11 shows that if f = c(f) fi and g = c(g)gi, then fi, g1 € Z[X] 


(b) 


(c) 


are primitive and fg = c(f)c(g)figi, so that c(fg) = c(f)c(g)c(figt). 
Hence, it suffices to show that | g1 is primitive, or, which is the same, that 


f, g primitive + fg primitive. 


If f is not primitive, then there exists a prime p € Z such that p divides all 
of the coefficient of f. Then, p divides all of the coefficients of fg, and fg is 
not primitive. Analogously, if g is not primitive, then fg is not primitive too. 

Conversely, assume that f(X) = a,X™” +---+a,X +o and g(X) = 
b,X" +-+++b,X + bo are both primitive, but fg is not. Take a prime p € Z 
such that p divides all of the coefficients of fg, and let k,/ > O be the smallest 
indices such that p { ax, bj (such k and / do exist, for otherwise f or g would 
not be primitive). Letting cy.) denote the coefficient of X**! in fg, we have 


Cel = +++ tag—abi42 + ag—1b]41 + gb + agg ibi-1 + ag-obi-2 +++. 


Now, since p | cg4; and p | ao,...,ak—1, bo, ..., bj-1, the above equality 
guarantees that p | a,b;, which is a contradiction. 

Implication <) is clear. For the converse, take f € Z[X]\ Z aur and 
suppose that f is reducible in Q[X], say f = gh, with g,h € Q[X]\Q. 
Lemma 19.11 assures that we can take a, b, c, d € Z\ {0} such that g = (a/b) g1 
and h = (c/d)h,, with gj, h, € Z[X] primitive. Then, 


bdf = bg - dh = ag, - ch, = acgjhy. 


However, since gih is primitive by (a), taking concn in both sides of the 
equality above we obtain |bd| - c(f) = |ac|. Therefore, # b ag = tc(f) € Zand, 
once again from the equality above, f = tc(f)gih1, ie., f is reducible in 
ZX]. 

If f = (a/b)g, with a, b € Z \ {0}, then bf = ag, and taking contents we get 
|b| -c(f) = lal - c(g). However, since c(f) = 1 and c(g) = 1, it comes that 
|a| = |b|, and hence a/b = +1. 


Oo 


>For another proof of this item, see Problem 6, page 469. 
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We can finally state and prove another theorem of Gauss, which for integer 
coefficient polynomials is the analogue of Theorem 19.9. 


Theorem 19.14 (Gauss) Every primitive polynomial f € Z[X] \ Z can be written 
as a product of a finite number of irreducible polynomials in ZX]. Moreover, such 
a way of writing f is unique up to a reordering of the factors and multiplication of 
some of them by —1. 


Proof Let us start by establishing the existence of a factorisation of f as a product 

of irreducible polynomials. To this end, looking at f as a polynomial in Q[X], 

Theorem 19.9 assures the existence of irreducible polynomials pj,..., pr € QLX]\ 

Q such that f = pi... pe. Write pj = (a;/bi)qi, with aj, bj € Z \ {0} relatively 

prime and qj € Z[X X]\ Z primitive. Since q; is obviously also irreducible in Q[X], 

item (b) of Gauss’ lemma guarantees that q; is irreducible in Z[X]. Letting a = 
1-...ay andb = by ... bg, we then have 


f = (a/b)q ..- 4k 


However, since qj, ..., qx are all primitive, item (a) of Gauss’ lemma guarantees 
that gq, ...q, iS primitive too. Thus, taking contents in the last equality above, we 
get 


|b| = |b] -c(f) = lal - cq... 4x) = lal. 


It comes that a/b = 
polynomials in Z[X]. 

The proof of the uniqueness part of the statement parallels that of Theorem 19.9, 
once we show the following claim: if p, f, g € Z[X]\Z are such that p is irreducible 
and p | fg in Z[X], then p | f or p | g in Z[X]. To this end, note firstly (from 
item (b) of Gauss’ lemma) that p is also irreducible in Q[X]. In turn, this being so, 
we already know that p | f or p | g in Q[X]. If p | f in Q[X] (the other case is 
analogous), there exists f; € Q[X] such that f = fi p. Choosing a,b € Z such 
that fi = (a/b) fo, with fo € Z[X] \ Z primitive, we have 


tl, and hence f = qi...qx, a product of irreducible 


bf = bfip = afop. 


Now, p and f2 primitive implies (once more from item (a) of Gauss’ lemma) f2 p 
primitive; taking contents in the equality above, we obtain |b|-c(f) = |a|-c(fop) = 
|a|. Hence, a/b = +c(f) € Z, so that f; € Z[X] and p | f in Z[X]. oO 


Gauss’ theorem allows us to define the greatest common divisor of two polyno- 
mials f, g € Z[X] \ Z. To this end, given f € Z[X] \ Z, start using Gauss’ theorem 
to write 


f=te(f) fp... FP 
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with nj,...,nx € Nand fi,..., fg € ZX] being irreducible, pairwise non asso- 
ciated and as positive leading coefficients. This is the canonical factorisation 
of f in Z[X 

Now, let € Z[X] \ Z have canonical factorisation g = +c(g)g}" aay e If 
h € Z[X]\ Zis irreducible, has positive leading coefficient and is such that h ij St & 
then there exists 1 <i < k and1 < j </ for whichh = f; andh = g;, so that 
fi = gj- In turn, this means that, up to a change of notation, we can write 


FSA eh aot aad. gS Been eh er ace? 


with hy,..., hx, fi,---, fr: 21, +--+ &s € ZX] \ Z irreducible, with positive lead- 
ing coefficients and pairwise non associated and}, ...,k, Pl, -++s Pls 1s -+-9 Ys € 
Z... We thus let, in Z[X], 


ged(f, g) = ged(c(f), c(g))h{! .. AY, 


with aj = min{n;, m;}, for1 <i <k. 


Problems: Sect. 19.2 


1. * Prove Lemma 19.11. 

2. * Let f € Z[X] be a monic and nonconstant polynomial. If f is reducible over 
Q, prove that there exist monic and nonconstant polynomials g,h € Z[X] such 
that f = gh. 

3. Let f, g € Z[X] \ Z be primitive. For a given common divisor d € Z[X] of f 
and g, show that the following are equivalent: 


(a) d = gcd(f, g). 
(b) If d’ € Z[X] divides f and g in Z[X], then d’ | d. 


19.3. Polynomials Over Z, 


We saw in Sect. 11.2 that, for a prime p ¢€ Z, the set Z, of congruence classes 
modulo p can be furnished with operations of addition, subtraction, multiplication 
and division quite similar to those of C. In turn, thanks to such a resemblance, 
essentially all of the concepts and results on polynomials studied so far remain true 
within the set Z,[X] of polynomials with coefficients in Zp. 

Our purpose here is to make explicit comments on some similarities and 
differences between polynomials over Z, and over K, with K = Q, R or C. In 
this sense, we shall leave to the reader the task of checking that all of the other 
definitions and results presented for K[X] (except those of Sects. 15.3 and 18.2 and 
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of Chap. 17) remain valid, ipsis literis, for Zp[X]. We also take the opportunity to 
deduce, with the aid of polynomials over Z,, some results on Number Theory and 
Combinatorics inaccessible by other means. In particular, we complete the proof 
of the existence of primitive roots modulo p, thus completing the discussion of 
Sect. 12.2. 

Given f(X) = a,X" +---+a,X + ao € Z[X], we define Z| X] to be the set 
of formal expressions f of the form 


f(X) =G,X" + +--+ 4,X +H, (19.4) 


where do, @1,..., 4» respectively denote the congruence classes of do, a1, ..., Gn 
modulo p. As before, such an f is called a polynomial over Zp. 
The correspondence f +> f defines a map 


Tp: ZX] — Z,[X] 


L => 7 


which is obviously surjective and is called the canonical projection of Z[X] onto 
Zp[X]. For f, g € Z[X], it is immediate to verify that 


F(X) = R(X) in Zp[X] 
3h € ZX]; f(X) = + ph(X) in Z[X]. 
Equivalently, letting 
pZ[X] = {ph; h € Z[X]}, 
we have 
f=08 f © pZ[X]. 


We extend the operations of addition and multiplication in Z, to homonymous 
operations +, -: Zp[X] x Zp[X] > Zp[X] by setting, for f, g ¢ ZLX], 


f+g=f+eg and f-g= fg. 


We leave to the reader the task of verifying the well definiteness of these operations 
in Z»[X], which can be done in a way entirely analogous to the well definiteness of 
the operations of Z p (cf. Sect. 11.2; see, also, Problem 1). 

As in Z[X], we say that a polynomial f € ZplX] \ {0} as in (19.4) has degree 
nif Gy, # 0, ie., if p { ay. More generally, if f ¢ Z[X]\ pZ[X], then f 4 0 and 
of < of. 
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The two coming examples use the multiplication of Zp[X] to establish interesting 
properties of binomial numbers. 


Example 19.15 If p € Z isa prime number and k € N, prove that (”) is a multiple 
of p, for every integer 1 < j < p*. 


Proof Example 6.42 shows that (X+1)? = X?+1inZ pLX]. Assume, by induction 
hypothesis, that (X + TP = xP +1in Zp[X] and for some / € N. Then, by 
successively applying the initial case and the inductive hypothesis, we obtain 


x ee 7 7 - 
(X4D"" = (K+ 0")? = (KP HT = (XP) ET a XP +. 
Thus, 
(X+1)" =x 47 


for every k EN. 
On the other hand, we also have 


(X4+ TD)" = xP + (n° jee Suave (‘ )x+ 1, 


= 2. : 
so that ("*) = 0 for every integer 1 < j < p*. In other words, p divides (7, i), Oo 


The next example makes use of the binary representation of natural numbers (cf. 
Example 4.12 of [8] or Problem 11, page 163). For a generalization, see Problem 11. 


Example 19.16 (Romania) Prove that the number of odd binomial coefficients in 
the n-th line of Pascal’s triangle is a power of 2. 


Proof Let 
n = 20 4+ 24-1 saa pe QAt + 240 


be the binary representation of n, with O < ay < aj < ... < ag. The previous 
example allows us to write, in Z2[X], 


(X40 =O4 Xs... e+" 
ck aes 7 (19.5) 
SOC Ge oe De 


Let S denote the set of natural numbers which can be written as a sum of (at 
least one of) distinct powers of 2, chosen from 2%, 2“—!,..., 2 and 2. By the 
fundamental principle of counting and the uniqueness of binary representation of 
naturals, we have |S| = 2+! — 1. On the other hand, by expanding the products of 
the last expression in (19.5), we get 
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(X47) = P= xo 1 (19.6) 


mes 


a sum with exactly 2‘+! summands. Now, by invoking once more the formula of 
binomial expansion, we have 


abn ane (Tat ttt ( " )X +1. (19.7) 
ims 


Therefore, by comparing (19.6) and (19.7), we conclude that exactly 2‘+! of the 


binomial numbers of the form (‘) (which compose the n-th line of Pascal’s triangle) 


satisfy () # 0, i.e., are odd. o 


In order to define the polynomial function associated to a polynomial f € Z plX], 
we have to take some care. Firstly, note that if f € Z[X] and a,b € Z satisfy 
a = b(mod p), then item (c) of Proposition 10.6 guarantees that 


f(@ = f(b) (mod p); 


on the other hand, if f = g in Z,[X], we saw above that there exists h € Z[X] such 
that f(X) = g(X) + ph(X). Hence, fora € Z we have 


f(@ = g(a) + ph(a) = g(a) (mod p). 


Given f € Z pL|X], the above comments allow us to define the polynomial 
function f : Z p > Zp by setting, fora € Z, 


f@=2@), (19.8) 


where g € Z[X] is any polynomial for which f = %. Obviously, the image of f 
is a finite set, for Z p is itself finite. From now on, whenever there is no danger of 
confusion, we shall write (19.8) simply as 


f@ =f. 


The coming example shows that, contrary to what happens with polynomials 
over Q, R or C, the polynomial function associated to a nonzero polynomial over 
Zp[X] can vanish identically. In other words, it is no longer valid that two distinct 
polynomials over Z, have distinct polynomial functions. 


Example 19.17 The polynomial f(X) = X? — X € Z)[X] is clearly a nonzero 
element of Z,[X]. On the other hand, letting f : Z,) — Zp» denote its associated 
polynomial function, Fermat’s little theorem gives 
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f@ =a? —a=aP—a=0 


for every a € Z,. Thus, f vanishes identically. 


Let f € Z[X] anda € Z be given. As in Sect. 15.1, we say that @ € Zp is a root 
of f provided f(a) = 0. An easy review of the proof of the root test shows that 
it continues to hold in Z,[X]. In particular, from the above example we obtain the 
following important result. 


Proposition 19.18 Jn Z,[X], we have 
eH 1S He, A= GD); 


Proof Since 1.2.4.5; p-1 are roots of X?~! — Tin Zp» (from the last example), 
item (c) of Proposition 15.3 assures that the polynomial X?~! — 1 is divisible by 
(f= 1 =?) 21% = (p — 1)) in Z,[X]. However, since both such polynomials 
are monic and have degree p — 1, they are actually equal. Oo 


In Zp[X], all of the definitions and results of Sect. 19.1 remain true. In particular, 
we can state the following theorem, whose proof is entirely analogous to that of 
Theorem 19.9. 


Theorem 19.19 [f p € Z is prime, then every polynomial f € Zp[X]\ Zp can 
be written as a product of a finite number of irreducible polynomials over Zp. 
Moreover, such a decomposition of f is unique up to association and reordering 
of the irreducible factors. 


As a first application of the previous result, we shall now prove the existence of 
primitive roots modulo p. For the statement and proof of the coming result, perhaps 
the reader might want to review the material of Sect. 12.2. 


Theorem 19.20 Jf p € Z is an odd prime and d is a positive divisor of p — 1, then 
the algebraic congruence 


x?-! _ | = 0(mod p) (19.9) 


has exactly p(d) roots of order d, pairwise incongruent modulo p. In particular, p 
has exactly ~(p — 1) primitive roots pairwise incongruent modulo p. 


Proof For a positive divisor d of p — 1, let N(d) denote the number of roots of 
the algebraic congruence (19.9) which are pairwise incongruent modulo p and have 
order d. Since the roots of (19.9) are the integers 1,2,..., p — 1, and since by 
Proposition 12.2 each of these numbers has order equal to a divisor of p — 1, we 
conclude that 


Y> N@)=p-1. 


0<d|(p—1) 
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If we show that N(d) < g(d), Proposition 8.11 will give 


p-l1= Yi M@< YO o@= 


0<d|(p—1) 0<d|(p—1) 


Therefore, we will obtain V(d) = g(d) for every positive divisor d of p — 1. 
Then, let d be a positive divisor of p — 1. If N(d) = 0, it is clear that N(d) < 
y(d). Otherwise, let ia be an integer of order d modulo p; then, the congruence 
classes 1, a, a € Zp are pairwise distinct roots of Mia=TEZ p[X]. On the 
other hand, Corollaey 15.8 paamanibes that such a polynomial has at most d distinct 
roots in Zp, so that its roots are exactly 1, a, ..., a‘—!. Therefore, if a € Z is 
a root of order d of (19.9), then @ € Zp is a root of xd —le Zp[X], so that 
@ e211, Gxcus aé ae We conclude that the roots of order d of (19.9) are precisely 
the elements of order d of the set {1,a,..., at} and, hence, 


N(d) = #0 <k <d—1; ord,(a*) = d}. 


Item (c) of Proposition 12.5 counts the number of elements of the set in the right 
hand side above: since ord,(a) = d, we have 


d 
ord,(a*) =d © ad. = d & ged(d,k) = 1. 


Therefore, 


#0 <k <d—1; ord,(a*) = d} =#{0<k <d—-1; ged, = 1} 
= 9(d). 


The above argument has shown that either N(d) = 0 or ¢g(d), so that, in any 
case, we have N(d) < y(d), as wished. 

For what is left to do, it suffices to note that N(p — 1) = g(p — 1), so that 
there are exactly g(p — 1) integers, pairwise incongruent modulo p and with order 
Pp — 1 = (Pp) modulo p. In other words, there are precisely y(p — 1) pairwise 
incongruent primitive roots modulo p. (For the sake of completeness, it is worth 
noticing that this result agrees with that of Proposition 12.8.) oO 


We finish this section by exhibiting yet another beautiful application of the theory 
of polynomials over Z, to Number Theory. 


Example 19.21 (Miklés-Schweitzer) If p > 3 is a prime number satisfying p = 
3 (mod 4), prove that 


I] (x? + y*) = 1 (mod p). 
lsx¢ys2t 
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Proof In all that follows, without further mention, indices in products will vary from 
1 to — 
If P stands for the product at the left hand side, then 


P =| [@?+1)-[]@?+2)-...- ]] (7+(2*)) 


Al xA2 xZ2et 


2 


| Tye? +) -T]e? +2). T+ (2 )) 
= ae, é 


Now, for each integer | < k < =, let cy, denote its inverse modulo p. Then, 
modulo p we get 


[[@?4+?) se] ] (ax +d =e TQ? +b. 


Indeed, the first congruence is panera and, for the second one, it suffices to 
observe that {(cyx)*; 1 < x < Phi is a Set of Pe ; 
residues modulo p; hence, also sroduilé p, we have 


2 


{(ckx)?; l<x< 2} = tes 1l<x< aI. 


It then follows that, modulo p, 


DF PBs (PES) Pat Pine (Po* = (To? +) 7 


or, which is the same, 


“P= ([]@?+ »)* 


Let a bea peumtive root modulo p and Q = TG? +1). Since {a**; 1 < k < 
Bot) is a set of Co | pairwise incongruent quadratic residues modulo p, we have 
fark, L<k< est) = {x22 1<x< P=!) modulo p, so that 


O= | [@* +1) and 2° P= Q* (mod p). (19.10) 
In order to compute the residue of Q, modulo p, let f € Z[X] be defined by 


f(X) = (X —a?)(X — a4)... (X — a? 74), 
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so that 


p-l 


Q=(-1)? f(-l) =—f(-]) (mod p). 


(Here we have used the fact that p = 3 (mod 4) in the second congruence above.) 
Now, observe that 


-1 


FRSC HOE HO) X= oF \(X+a)X+02)...(K+ar). 


For 1 <i,j < aad ifa! = —a/ (mod p), then att = QJ (mod p) and, since a 
is a primitive root modulo p, we obtain 2i = 2), thus i = j; however this being 
the case, we would have a’ = —a' (mod p), which is an absurd. Then, the set 


{+a, a, ..., ws} is a RRS modulo p, so that it coincides, modulo p, with 
{1,2,..., p — 1}. Therefore, letting f € Zp[X] denote the image of f by the 
projection of Z[X] onto Z,[X] and invoking the result of Proposition 19.18, we 
arrive at 


Fie) Ss & - DSK =2).. (Xk — p —H Sake 1 
and, hence, at 
f= xX -T. 


However, since p = 3 (mod 4), this gives 


ie., Q =2(mod p). 


p—1 
Finally, substituting this information in (19.10), we get the congruence 2°T P= 


im 


25 (mod p), and hence P = 1 (mod p). 


Problems: Sect. 19.3 


1. * Given a prime number p, check the well definiteness of the operations of 
addition and multiplication in Z»,[X]. Also, show that such operations are 
associative and commutative, have identities respectively equal to the constant 
polynomials 0 and 1, and that multiplication is distributive with respect to 
addition. 

2. Let p > 2 bea given prime. Find, if any, the roots of X?=! +1 € Zp[X]. 
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3. 


10. 


11. 


* Given f € Z[X] and an integer root a of f, prove that a € Zp is a root of 
f € Z)[X]. In particular, conclude that if @1,..., a, € Zp are the roots of f, 
then there exists 1 < j < k such that a = a; (mod p). 


. Show that f(X) = X? — 15X? + 10X — 84 € Z[X] has no rational roots. 
. *If p € Zis prime and f € Z[X], prove that f(X?) = f(X)?. 
. * Use the projection z : Z — Z, to prove that if f, g ¢ Z[X] \ Z are primitive 


polynomials (cf. Definition 19.10), then so is fg. 


. Let p > 2 bea prime number and | < d < p — | bean integer. 


(a) If d { (p — 1), show that X¢ — T has no roots in Z p(X]. 
(b) If d | (p — 1), factorise X¢—Tin Zp[X]. 


. * Let p > 3 bea prime number and, for 1 < j < p—1,lets;(1,2,..., p—1) 


denote the j-th elementary symmetric sum of the natural numbers 1, 2,..., p— 
1. Prove that?: 

(a) For 1 < j < p—2, we have s;(1,2,..., p— 1) =0 (mod p). 

(b) sp_1(1, 2,..., p— 1) = —1 (mod p). 


. * Ifa, b and c are the complex roots of the polynomial X* — 3X? + 1, show 


that a” +b" +c” € Z for every n € N, and that such a sum is always congruent 
to 1 modulo 17. 

(France) For a given n € N, let J, denote the number of odd coefficients of the 
polynomial (X? + X + 1)”. 

(a) Compute Jo, form € Z+. 

(b) Show that, form € N, we have 


gm+l ne (— yy7tl 


Inm_| = 3 


For the coming problem, the reader may find it useful to review the statement 
of Problem 11, page 163. 
Prove Lucas’ theorem: given natural numbers m > n and a prime number p, 
if 


k k 
m=)" mjp! and So njp! 
j=0 j=0 


are the representations of m and n in base p, then 
k 


()=f1(i) men 


j=0 


3Note that item (b) provides another proof of part of Wilson’s Theorem 10.26. 
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In particular, conclude that: 


(a) p | ("") if and only if mj < nj; for some 0 < j <k. 
(b) Exactly (mo + 1)Qm, + 1)... (mx + 1) binomial numbers of the form (") 
are not divisible by p. 


(c) No binomial number of the form (? oe ~') is divisible by p. 


12. Let p be a prime number and k € N. Prove that 


p§(p—V)\ _ [ (-1)4 (mod p), if 1 = p*g; 0<q<p-l,qeZ 
l ~ | O(mod p), otherwise. , 


19.4 More on Irreducible Polynomials 


Up to this moment, we do not have at our disposal any method for establishing 
the irreducibility of a given polynomial. Obviously, in particular cases one can 
sometimes use the direct method, which consists in writing the given polynomial 
f (say, in Z[X]) as a product of two other polynomials g,h € Z[X], and then 
of solving the resulting system of equations in the coefficients of g and h, thus 
obtaining a nontrivial factorisation for f or reaching a contradiction. We illustrate 
such a procedure in the coming example (see, also, Problems | and 2). 


Example 19.22 Prove that f(X) = X*++ 10X3+5X + 1993 is irreducible over Q. 


Proof By Gauss’ lemma (cf. Proposition 19.13), it suffices to show that f cannot 
be written as a product of two nonconstant polynomials with integer coefficients. 
We now observe that 1993 is prime, a fact that can be easily established with the 
aid of Eratosthenes’ sieve, for example. Hence, the rational roots test implies that 
the possible integer roots of f are +1 or +1993, and direct checking shows that 
none of these is actually a root. 
We are left to discarding the possibility of f having a factorisation of the form 


F(X) = (X? +aX +b)(X? + eX +d), 


with a,b,c,d € Z. If such happens, then, by expanding the products at the right 
hand side and comparing coefficients, we obtain the following system of equations: 


a+c=10 
ac+b+d=0 
ad+be=5 © 
bd = 1993 


The fourth equation, together with the primality of 1993 and the symmetry of 
the factorisation (two factors of degree 2) give (b, d) = (1, 1993) or (—1, —1993) 
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as the only essentially distinct possibilities. From them, the first and third equations 
furnish the systems of equations 


a+c=10 a+c=10 
a+1993c =5 a+1993c = —5 ° 
Finally, it is immediate to check that none of them has integer solutions. oO 


As testified by the calculations performed in the example above, the use of the 
direct method to check that a given polynomial f € Q[X] is irreducible over 
Q is likely to face considerable computational difficulties, even in the case of f 
having integer coefficients and small degree. Therefore, it would be quite nice to 
develop a few techniques that could be effectively applied to the investigation of the 
irreducibility of polynomials. In this sense, we start with the following result. 


Proposition 19.23 Let f € Z[X] be monic, and p € Z be prime. 


(a) Iffe Z p[X] is irreducible in Zp[X], then f is irreducible in Z[X]. 
(b) If f(X) = (X — a)g(X), with g € Zp[X] irreducible, then either f is 
irreducible in Z[X] or it has a root « € Z such that a = a (mod p). 


Proof 


(a) By contraposition, suppose f(X) = g(X)h(X), with g,h € Z[X] \ Z monic 
polynomials. Then, f(X) = @(X)h(X), with 3g = dg and dh = dh. 

(b) Suppose f to be reducible, say f(X) = h(X)/(X), with h,/ € Z[X] \ Z being 
monic. Then, 0h = dh, 01 = 41 and 


(X — @)g(X) = h(X)I(X). 


However, since g is irreducible in Z plX], Theorem 19.19 assures that either h 
or / must be associated to X — a, so that 0h = 1 or 0] = 1. 

Assume, without loss of generality, that dh = 1. Then, h(X) = X — a, for 
some a € Z, so that is an integer root of f. Finally, since h(X) = X — a (for 
h is monic and associated to X — a), it comes that @ = a. 

Oo 


Example 19.24 Let p € Z be prime, with p = 5(mod6), and k € N be such that 
kp + 1 is also prime. Prove that f(X) = X*? + pX* + pX + kp + 1is irreducible 
in Z[X]. 


Proof Projecting f into Z,[X], we obtain 
F(X) = X341=(X%4+ D0? - X41). 


We now claim that —1 is the only root of f in Z p- Indeed, if f(@ = 0, then 
a> = —1, so that a? = —1 (mod p). Thus, a® = 1 (mod Pp), and hence (by Fermat’s 


little theorem, together with item (b) of Proposition 12.2) 


ord p»(a) | gced(6, p — 1) = 2. 
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Therefore, ord »(a) = 1 or 2. Since ord p(a) =1>a= 1, which is not a root of 
f, we must have ord »(a) = 2. Then, a~ = 1 (mod p) and a ¥ | (mod p), so that 
a = —1 (mod p) or, which is the same, a = —1. 

Since 3 4 0 in Z p» the above claim guarantees that X 2 _ X + Thas no roots in 
Zp[X]; thus, it is irreducible in Z,[X]. The previous proposition then assures that 
either f is irreducible in Z[X] or it has a root a € Z such that a = —1 (mod p). 
However, the searching criterion for rational roots assures that an integer root a of f 
must be a divisor of kp + 1, which is prime. Then, either a = —1 ora = —(kp+1), 
and testing these possibilities we conclude that none of them is actually a root of f. 
Thus, f is irreducible. Oo 


Remark 19.25 For a given p € N, we shall prove in Sect. 20.2 that there exist 
infinitely many k € Z for which kp + | is prime. 


The corollary of the coming result of F. Eisenstein, known in mathematical 
literature as Eisenstein’s criterion, will show to be a powerful ally in establishing 
the irreducibility of certain polynomials of integer coefficients. 


Theorem 19.26 (Eisenstein) Let f(X) = a,X" +---+a,X +o be a polynomial 
with integer coefficients and degree n > |. Let also p € Z be prime and\1 <k <n 
be an integer such that: 


(a) p|ao,41,..., ak. 
(b) p> tao and p{ ay. 


If f = gh, with g,h € Z[X], then max{dg, 0h} > kK +1. 
Proof Letting g(X) = b-X" +.---+ bo and h(X) = c,X* +--+ +0, we have 
ay = bycs and ag = boco. Therefore, 


Pt Gn => pt brcs => pt by and pf cs, 


so that, in Zp»[X], we have f(X) — 2(X)h(X), with dg = dg =r and dh =dh=s. 
If X | g(X) and X | h(X) in Z)[X], then p | bo and p | co in Z, so that 
p | boco = ao, which is not the case; hence, in Z,[X], the polynomial X divides 
at most one of g or h. 
Assume, without loss of generality, that X does not divide h. Since (by 
hypothesis (b)) 


B(X)A(X) = F(X) = Gy X" +--+ + Gy XO 


= Ce ae +++ Gee) X**1, 


we thus conclude that X*+! | g(X). Thus,r >k+ 1. oO 
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Corollary 19.27 (Eisenstein) Let f(X) = a,X"+---+a,X +d be a polynomial 
with integer coefficients and degree n > 1. If there exists p € Z prime such that 
p | op Gis 402 pMtgnie BD { ag and p ¢ ay, then f is irreducible in Q{X]. 


Proof By Eisenstein’s theorem, if f(X) = g(X)A(X), with g,h € Z[X], then 
either 0g > n or Oh > n, ie., either g or h is constant. Thus, f cannot be written 
as a product of two nonconstant polynomials with integer coefficients, and Gauss’ 
lemma guarantees that f is irreducible in Q[X]. oO 


The coming example brings a classical application of Eisenstein’s criterion, 
which will be obtained by other means in Sect. 20.2. 


Example 19.28 If p € Zis prime and f € Z[X] is given by 
FSP ey Baie Kd, (19.11) 


then f is irreducible in Q. 


Proof We start by noticing that f(X) = g(X)A(X) if and only if f(X + 1) = 

g(X + DA(X + 1), with g(X) and g(X + 1) (as well as h(X) and h(X + 1)) having 

equal degrees. Thus, it suffices to show that f(X + 1) is irreducible in Q[X]. 
Since (X — 1) f(X) = X? — 1, we obtain 


KAK+ = OLE DP —1= xP+ (xP et [ mi 
i= 


and hence 
f(X+1=xP ty (P)xr eset an 


Now, note that (a) = p and recall, from Example 6.42, that p | C) for 1 < 
k < p—1. Hence, we can apply Eisenstein’s criterion with the prime p to conclude 
that f(X + 1) is irreducible in Q[X]. oO 


Our second example uses the full force of Eisenstein’s theorem. 


Example 19.29 (IMO) Given a natural number n > 1, prove that the polynomial 
X" +5X"—! 4 3 is irreducible in Z[X]. 


Proof Let X" + 5x™143= g(X)h(X), with g,h € Z[X] monic. With p = 3 in 
Eiseinstein’s theorem, we conclude that 


max{dg, dh} >n—1, 


so that either f is irreducible or it has an integer root. 
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On the other hand, if this last possibility takes place, then the searching criterion 
for rational roots assures that such a root is +1 or +3. However, direct computation 
shows that none of these four numbers is a root of the given polynomial. Oo 


In spite of the theory developed above, we frequently establish the irreducibility 
of a given polynomial by invoking ad hoc methods. Let us take a look at two such 
examples. 


Example 19.30 (Romania) Let f € Z[X] be a monic polynomial of degree n > 1 
and such that f(0) = 1. If f has at least n — 1 complex roots of modulus less than 
1, prove that f is irreducible in Q[X]. 


Proof By Gauss’ lemma, it suffices to show that f is irreducible in Z[X]. To this 
end, and for the sake of contradiction, assume that f = gh, with g,h € Z[X] \ Z. 
Without any loss of generality, we can take g and h to be monic; moreover, from 
g(0)h(O) = 1 we get g(0), A(O) = +1. On the other hand, Girard-Viéte relations 
(cf. Proposition 16.6) assure that the modulus of the product of the complex roots of 
f is equal to 1, so that the hypotheses of the example guarantee that f has exactly 
one complex root of modulus greater than 1. Hence, one of the polynomials g or h, 
say g, has only complex roots of modulus less than 1. However, since g is monic and 
|g(0)| = 1, by invoking Girard-Viéte relations again, we conclude that the modulus 
of the product of the complex roots of g is equal to 1, which is a contradiction. O 


Remark 19.31 Some complex analysis assures that polynomials f € Z[X] satisfy- 
ing the hypotheses of the previous example can be easily constructed. For instance, 
let f(X) = anX”" + an_,X"-! +--+ +.a,X + ag € Z[X] be such that 


lan—1| > |@n| + |an—2| + lan—3| +--+ + lal. (19.12) 


If |z| = 1, then the triangle inequality for complex numbers gives 


24... +aiz+ aol 


-1 = 
| f(z) — Gn—12" |= lanz” + An—22" 
< |an| + |an—2| +--+ + lai] + laol 


-1 
< |an—-1| = |an—12" |. 


In particular, f does not vanish on the unit circle |z| = 1 of the complex plane, and 
Rouché’s theorem (cf. [11], page 125) assures that f has exactly n — 1 complex 
roots of modulus less than |. Of course, the extra requirement that a, = ag = 1 
does not preclude the validity of (19.12). 


Example 19.32 (IMO Shortlist) Let n > 1 be an odd integer and aj,..., da, be 
pairwise distinct given integers. Show that 


f(X) = (X — a1)(X — az)... (X — an) — 1 


is irreducible in Q[X]. 
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Proof Once more Gauss’ lemma reduces the problem to showing that f cannot be 
written as the product of two nonconstant polynomials with integer coefficients. By 
contradiction, assume we have f = gh, with g and fA being nonconstant, monic 
polynomials with integer coefficients. Since 


g(aji)h(a;) = f(aj) = -1 


and g(aj), h(a;) € Z, we must have g(a;) = 1 and h(a;) = —1, or vice-versa; in 
any event, it happens that g(a;) + h(a;) = 0. Now, let f; = g +h. Since g andh 
are monic, f; does not vanish identically. Therefore, 


of; < max{dg, dh} < of =n, 


so that f; has at most df; < n complex roots, counted according with their 
multiplicities. However, since f\(a;) = 0 for 1 <i <nandaj,..., dy, are pairwise 
distinct, we have arrived at a contradiction. oO 


Problems: Sect. 19.4 


1. Use the direct method described in the beginning of this section (and applied in 
Example 19.22) to prove that X* — X? + 1 is irreducible in Z[X]. 

2. * Also with the aid of the direct method, show that f(X) = X°> — X*-—4x3 4 
4X? + 2 is irreducible over Q. 

3. A real polynomial is said to be positively reducible if it can be expressed as a 
product of two nonconstant real polynomials with nonnegative coefficients. Let 
f be areal polynomial such that, for some n € N, the polynomial f(X”) is 
positively reducible. Show that f is also positively reducible. 

4. (IMO shortlist) Let n > 1 be an integer and aj,..., da, be pairwise distinct 
positive integers. Prove that 


F(X) = (X — ay)(X — az)...(X — an) +1 


is reducible in Q[X] only in the following cases: 


i, n= 2and f(X) = (X —a)(X —a—-2)+1. 
ii, n =4and f(X) = (X —a)(X —a—1)(X —a-2)(KX -—a—-3) 41. 


5. Let p be an odd prime, f(X) = 2xXP?-!_1 and g(X) = (X-1)(X—-2)... (X- 
p+1). Prove that at least one of f(X)— g(X) or f(X)—g(X)+ p is irreducible 
in Z[X]. 

6. Let p € Z be a prime number and f(X) = Mins XO +++ bayX” +--°+ 
a,X + ao be a polynomial with integer coefficients and satisfying the following 
conditions: 
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10. 


11. 


12. 


(a) pp | dg, d],---,y- 
(b) p | Qn41, n42,.--, 2n- 
(c) p> | ao and pt don41- 


Show that f cannot be written as the product of two nonconstant polynomials 
with integer coefficients. 
Let p be a prime number and k € N. Prove that* 


f(X)= xP (-)) a xP | (p-2) Avaeak ae +1 


is irreducible over Q. 

(Romania) Let p € Z be a prime number. If f(X) = ayn X" +an_1X"-!4---+ 
a,X + ao € Z[X] is such that |a9| = p and |a,| + lan—1| +---+ lal < p, 
prove that f is irreducible over Q. 

(Romania) Let f(X) = a,X”" + +--+ a,X + ao be a nonconstant polynomial 
with integer coefficients. If |ap| > |a1|+|a2|+---+lan| and /ao] < /Jan|+1, 
prove that f cannot be written as the product of two nonconstant polynomials 
with integer coefficients. 

(Romania) Let N = A,;UA2U...UAx, be a partition of the set of natural numbers 
into k nonempty sets. Given m € N, prove that there exists | < j < k for 
which we can construct infinitely many polynomials f satisfying the following 
conditions: 


i. of =m. 
ii. All of the coefficients of f belong to A;. 
iii. f cannot be written as the product of two nonconstant polynomials with 
integer coefficients. 


Let f be a polynomial with integer coefficients and degree n > 1, and z),..., 
Zn be the complex roots of f. 


(a) If Re(z;) < Ofor 1 < j < 21, prove that all of the coefficient of f have a 
single sign. 

(b) Prove Pélya-Szegé’s theorem”: assume that there exists m € Z such that 
f(m) is prime, f(m— 1) # Oandm > 5 +Re(z;), for 1 < j <n. Then, f 
cannot be written as a product of two nonconstant polynomials with integer 
coefficients. 


5 


(BMO) If the coefficients of a prime number are taken as the coefficients of 
a polynomial f € Z[X], prove that f is irreducible over Z. More precisely, 
let n > 1 be an integer and ag, aj, ..., G,—1, a, be natural numbers such that 
a, # OandO0 < a; < 9 for0 <i <n. If f(10) is prime, prove that f is 
irreducible over Z. 


4As Problem 1, page 492, will show, the result of this problem is a particular case of Theo- 
rem 20.18. 


5George Pélya and Gabor Szeg6, Hungarian mathematicians of the twentieth century. 


Chapter 20 Mm) 
Algebraic and Transcendental Numbers spooks 


We start this chapter by inverting the viewpoint of Chap. 15. More precisely, we 
fix a complex number z and examine the set of polynomials f € C[X] for which 
f(z) = 0. As a byproduct of our discussion, we give a (hopefully) more natural 
proof of the closedness, with respect to the usual arithmetic operations, of the set of 
complex numbers which are roots of nonzero polynomials of rational coefficients. 
We then proceed to investigate the special case of roots of unity, which leads us to 
the study of cyclotomic polynomials and allows us to give a partial proof of a famous 
theorem of Dirichlet on the infinitude of primes on certain arithmetic progressions. 
The chapter closes with a few remarks on the set of real numbers which are not roots 
of nonzero polynomials with rational coefficients. 


20.1 Algebraic Numbers Over Q 


A complex number «@ is said to be algebraic over Q if there exists a polynomial 
F € QLX] \ {0} such that f(@) = 0. A complex number which is not algebraic over 
Q is said to be transcendental over Q. In this section and the next one we stick to 
the case of algebraic numbers, leaving a few word on transcendental numbers to be 
said in Sect. 20.4. 

Obviously, every rational number r, being a root of the polynomial X —r € 
QLX] \ {0}, is algebraic over Q. In turn, the coming example collects less trivial 
instances of algebraic numbers over Q. 


Example 20.1 Letr € Qi andn € N. If @ is an n-th root of unity, then Y/r@ 
is algebraic over Q, for such a number is a root of the nonzero polynomial with 
rational coefficients X” — r. 


We could have also defined, in an obvious way, what one means by a complex 
number a to be algebraic over IR. Nevertheless, such a concept would not be 
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interesting, for every complex number is algebraic over R. Indeed, give a nonreal 
complex number a = a + ib, we have that @ is a root of the nonzero polynomial 


f(®%) =(X — (a+ bi))(X — (a — bi)) 
= X* —2aX + (a* +b’) € R[X]. 
For this reason, whenever we consider an algebraic number over Q, we shall simply 


refer to it as an algebraic number: 
If a complex number a is algebraic, the set 


Aa = {f € QLX] \ {0}; f(a) = 0} 


is nonempty by definition. Then, it is also nonempty the set of nonnegative integers 
{af; f € Ag}, so that there exists py € Ay, monic and of minimum degree. We 
thus have the following 


Definition 20.2 Given a complex number @ algebraic over Q, a polynomial py € 
Q[X] \ {0}, monic, of minimum degree and having a as a root is called a minimal 
polynomial for a. 


The coming proposition and its corollaries collect the most important properties 
of minimal polynomials of algebraic numbers. 


Proposition 20.3 [fa € C is algebraic over Q and py is a minimal polynomial of 
a, then: 


(a) Pa is irreducible over Q. 
(b) If f € Q(X] is such that f (a) = 0, then pa | f in QLX]. 


In particular, Py is uniquely determined by a. 
Proof 


(a) If we had py = fg, with f and g being nonconstant and of rational coefficients, 
then the degrees of f and g would be less than that of py and at least one of 
them would have a as a root. In turn, this would contradict the minimality of 
the degree of py. Therefore, py is irreducible over Q. 

(b) By the division algorithm, there exist polynomials g, r € Q[X] such that 


F(X) = palX)q(X) + r(X), 
with r = 0 or 0 < Or < Opy. Ifr ¥ 0, then 
r(a) = f(a) — pala)q(a) = 0, 


with dr < dpq, and this would again be a contradiction to the minimality of the 
degree of py. Thus, r = 0 and, hence, py | f in Q[X]. 


'Nevertheless, we shall briefly consider algebraic numbers over Zp in Sect. 20.3. 
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Finally, if py and gq were minimal polynomials for a, then item (b) would 
give Pa | da in Q[X]. However, since py and gq are both monic and of the same 
degree, it would come that py = qa- 

oO 


Thanks to the former proposition, given a € C algebraic, we can refer to py as 
being the minimal polynomial of a. 


Corollary 20.4 [fa € C is algebraic and f € Q[X] \ {0} is a monic, irreducible 
polynomial such that f (a) = 0, then f = Pa. 


Proof By the previous result, py divides f in Q[X]. However, since f is irreducible, 
there must exist a nonzero rational number c such that f = cpy. Finally, since f 
and py are both monic, we must have c = 1. oO 


Corollary 20.5 [f f € Q[X] \ Q is irreducible, then f has no multiple roots. 


Proof We can assume, without any loss of generality, that f is monic. If some a € 
C is a multiple root of f, then Proposition 15.31 would guarantee that a is also 
a root of the derivative f’ of f. However, since f € Q[X] \ {0} is monic and 
irreducible, Corollary 20.4 assures that it is the minimal polynomial of a. Therefore, 
Proposition 20.3 gives f | f’ in Q[X], which is a contradiction to the inequality 
af > of’. Oo 


We now collect some interesting applications of Proposition 20.3 and its 
corollaries. 


Example 20.6 (IMO Shortlist) Let f be a nonconstant polynomial of rational 
coefficients and a a real number such that a? — 3a = f(a)? —3f(@) = -l. 
Prove that, for every positive integer , one has 


f™@)? -3f™@) =-1, 


where f stands for the composite of f with itself, n times. 


Proof If g(X) = X? — 3X +1, then dg = 3 and, searching rational roots (cf. 
Proposition 15.16), we conclude that g has none. Hence, Problem 5, page 457, 
guarantees that g is irreducible over Q, and Corollary 20.4 assures that g is the 
minimal polynomial of a. 

On the other hand, since we are assuming that the polynomial g o f also has a 
as a root, Proposition 20.3 show that g divides g o f in Q[X], say (g o f)(X) = 
g(X)u(X), for some u(X) € QLX]. 

Finally, note that we have g( f(a)) = 0 by assumption. Hence, if we have proved 
that g(f(a)) = 0 for some k > 1, then 


BC f'@)) = (80 NU @) = a(f(a)u( f(a) = 0, 


and there is nothing left to do. Oo 
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Example 19.28, in en with Corollary 20.4, immediately assures that if 
Pp is prime and w = cis 2, then the minimal polynomial of @ is 


Po(X) = XP 4 xP? 4.4 X41, 


The coming example uses this fact to give a beautiful alternative proof of Exam- 
ple 11.5, due to the Bulgarian mathematician N. Nikolov. 


Example 20.7 (IMO) Let p be an odd prime. Compute how many are the p-element 
subsets of the set {1, 2,..., 2p} such that the sum of its elements is divisible by p. 


Solution If @ = cis 2 , then w? = | and, hence, 


Dp 
= [|[«-o’) [a -o) 


j=l j=! 


Pp 2p 
=[[@-o/ [] «-o’) 
j=l j=ptl 


= =|[|[« _ w!). 
j=1 


Computing the coefficient of X? at both sides of the equality above and recalling 
that p is odd, we conclude that 


= > iit tip, (20.1) 
{f1,---Jp}Clop 
with the above sum ranging through all p-element subsets {j1,..., jp} of lop = 


{1,2,...4; 2p}. 

On the other hand, if for 0 < k < p — 1 we let cg denote the number of p- 
element subsets {j1,..-, jp} C ap for which jj +--+ + jp = k (mod p), then 
w? = | assures that the right hand side of (20.1) can be written as YS cho*, so 
that 


pol 


> cpok = 2. 
k=0 


It follows from what we did above that w is a root of the polynomial with integer 
coefficients f(X) = = : =p te — 2. Note also that, since 


p-1 p+l 
2° 2 


[1. p 18, pr A ree ses ,2p | 
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has p elements and sum of elements congruent to p — 1 modulo p, we have cp_1 # 
0. Hence, df = p—1. 

Now, since the minimal polynomial of w is py(X) = X?~!'+---+X +1, item 
(b) of Proposition 20.3 guarantees that f is an integer multiple of p,, say f = cPw, 
for some c € N. In particular, by comparing coefficients on both sides of such an 
equality, we conclude that 


co-2=cy S++ =Cp-1 =. 


Finally, in order to compute the value of c, note that co +c, + +--+ €p—1 is equal 
to the total number of p-element subsets of Jz), so that 


2p 
pet2=egote +--+ +ep1= ag 


Therefore, 
1 2p 
CO —— 9 & + 2 —— — 2: + De 
P Pp 


Example 20.8 (Romania) Let f € Z[X] be a monic polynomial, of odd degree 
greater than 1 and irreducible over Q. Suppose also that: 


oO 


(a) f(O) is square-free. 
(b) The complex roots of f have modulus greater than or equal to 1. 


Prove that the polynomial F € Z[X], given by F(X) = f(X 3). is also irreducible 
over Q. 


Proof By contradiction, assume that F is reducible over Q. Then, it follows from 
Problem 2, page 461, that there exist monic and nonconstant polynomials g,h € 
ZX] such that F = gh. Since dF = 30f and df is odd, we conclude that dF is 
also odd. Hence, Problem 5, page 386, guarantees that F has at least one real root a. 

We can assume, without loss of generality, that a is a root of g and that g is 
irreducible over Q. Indeed, if w is a root of g but g is reducible over Q, it suffices 
to take the monic irreducible factor of g having @ as a root, and then to use the 
result of Problem 2, page 461 to conclude that such an irreducible factor has integer 
coefficients. 

In the conditions of the previous paragraph, we know that g is the minimal 
polynomial of a. Now, if @ # | is a cubic root of 1, then 


0 = F(a) = f(a?) = f((aw)*) = F(a) = g(aw)h(ao). 
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We therefore distinguish two cases: 


(i) g(awm) = 0: since (cf. Problem 4, page 386) the nonreal roots of a real 
polynomial come in pairs complex-complex conjugate, we have that @ = aw” 
is also a root of g. Now, by grouping together in the expression of g all 
monomials with exponents of each one of the forms 3k, 3k + 1 and 3k + 2, 
we can write 


g(X) = a(X*) + Xb(X?) + Xe(X4), (20.2) 


with a, b,c € Z[X]. Therefore, 
a(a*) + ab(a*) + a*c(a*) = g(a) = 0, 


a(a>) + awb(a?) + aw*c(a>) = g(aw) = 0 


and 


a(a?) + aw b(a>) + a wc(a?) = g (aw) = 0. 


Since the three equalities above for a Vandermond system of equations in a(a), 
b(a>) and c(a?), we conclude from Proposition 18.6 (or by direct computation) 
that a(a?) = b(a3) = c(a3) = 0. Thus, letting p denote the minimal polynomial 
of «3, it follows from Proposition 20.3 that p divides a, b and c in Q[X] and, 
hence (again from Problem 4, page 386), in Z[X]. It follows from (20.2) that 
D(X 3) divides g(X). However, since g is monic and irreducible, we conclude 
that g(X) = p(X). 

Now, with g being a polynomial in X°, it follows once more from (20.2) that 
b,c = 0 and, hence, that 


F(X?) = F(X) = g(X)h(X) = a(X*)h(X). 


Such an equality, in turn, implies that h must also be a polynomial in X?, say 
h(X) = 1(X3), for some / € Z[X]. 
Finally, we have f (X*) = a(X3)I(X?) and, hence, f = gl, with dg, d/ > 1. But 
this contradicts the irreducibility of /. 

(ii) h(aw) = 0: as in item (i), we conclude that h(aw*) = 0. Let, also as above, 


h(X) = a(X3) + Xb(X3) + X7e(X3), 
with a, b,c € Z[X]. Then, 
a(a>) + awb(a>) + a7w*c(a?) = h(aw) = 0 
and 


a(a>) + aw*b(a*) + a wc(a?) = h(aw*) = 0. 


Multiplying the first of the equalities above by w and subtracting the second from 
the result, we obtain 


a(a?) — a?c(a*) = 0. 
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However, since g is the minimal polynomial of a, by invoking Proposition 20.3 
once again we conclude that g divides a(X?) — X2c(X3) in Q[X] and, hence, in 
Z[X] (by Remark 14.13, for g € Z[X] and is monic). 

Substituting X by 0, we conclude that g(0) must divide a(0) in Z. Letting a(0) = 
g(0)m, with m € Z, it comes that 


f 0) = g(O)h(0) = g(0)a(O) = g(0)’m. 


Since f (0) is square-free, it follows that g(0) = +1. Hence, letting z;,..., zz be 
the complex roots of g, the relations of Girard-Viéte give 


1 = |g(O)| = Izil.-- Ize. (20.3) 
On the other hand, since 
f@) =F E}) ae phe)=9, 


hypothesis (b) gives Iz} I > 1, so that |z;| => 1 for 1 < j < k. In turn, such 


an information, together with (20.3), allows us to conclude that |z;| = 1 for 
1 < j < k. In particular, |w| = 1 and, being a real number, we must have 
a = +1. Thus, w? = +1 and, since f is irreducible over Q with f(a?) = 0, we 
should have f(X) = X + 1, acontradiction to the fact that df > 1. Oo 


The rest of this section is devoted to showing that the set of algebraic numbers is a 
subfield of C, i.e., that it is a nonempty subset of C which is closed for the operations 
of addition, subtraction, multiplication and division (by a nonzero algebraic number, 
of course). We follow [7], and start with the following result. 


Theorem 20.9 Ifa, 8 € C \ {0} are algebraic, then so isa + B. 


Proof Leta = a1,..., Qn be the complex roots of py and 6 = f,,..., By, be those 
of pg, so that 


Pa(X) = | [(X — aj) and pg(X) = | [(X - 8;). 
j=l 


i=1 


Define 
f(% X1,..-,Xm) =] [x - 6)) = [ ] pp - Xd 
i=) j=1 i=1 (20.4) 


mn—-1 


=X 4 fic(X1, 0. Xm) X*, 
k=0 


for certain polynomials fo, ..., fimn—1 € QLX1, ..., Xm] (since pg € Q[X)). 
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If o is a permutation of J,,, then 


m m 
f(X, Xeqy,--+sXom)) =| [ ps - Xow) =| | pa X - Xd) 


i=l i=l 


= f(X, X1,...,Xm) 


so that (20.4) furnishes 
mn—-1 mn—1 
xm ~~ fi nines Rotax = xm > Fe Kay occ XX. 
k=0 k=0 


Therefore, we have 

Sk (Xo), sees Xo(m)) = A(X, .o+, Xm) 
for all 0 < k < mn — 1 ando, and we conclude that fj, is a symmetric polynomial 
in Xj,..., Xm, forO<k<mn-—1. 


Newton’s Theorem 16.12 then assures the existence of polynomials gx € 
Q[X), ..., Xm] for which 


Sk(X1, woe, Xm) = 8k(S1,~--5 Sm); 


with s1,...,5m € Q[X1,..., Xm] standing for the elementary symmetric polyno- 
mials in X1,..., Xm. In particular, 


Sk (1, «++, 1m) = Bk(si(@i),---,5m(ai)) € Q, 
for 
Pa(X) =X" — sy(a:)X"—! + +--+ (-1)"5m(@i) € QLX]. 
Thus, if h(X) = f(X, a1, ..., @m) then, on the one hand, 


mn—1 


h(X)=X"™ + J filo, ...,0m)X* € QUX] 
k=0 


while, on the other, 


h(X) =|] pp(X - a) =| [] [% - a - B)). 


i=l i=] j=1 
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Therefore, / is a nonzero polynomial with rational coefficients and such that h(a@ + 
B) = 0, which guarantees that a + £ is algebraic. Oo 


Remark 20.10 Suppose that pa, pp € Z[X]. Then, in the notations of the above 
proof, we have sj(@j), .-., Sn(ai) € Zand fe € Z[X1,...,Xm] forO < k < 
mn — |. Therefore, once more from Newton’s theorem, g; € Z[X1,..., Xm] and, 
hence, 


Sk (a, ee, Am) = 8k (S1 (i), ++, 5Sm(@j)) € Z 


for 0 < k < mn — 1. We thus conclude that h € Z[X], and Problem 4 shows that 
Pa+p € Z[X]. 
In order to show that the set of algebraic numbers is closed for products and 


quotients, we need the following particular case of Problem 4, page 416, for which 
we present a simpler proof. 


Lemma 20.11 [fa ¥ 0 is algebraic, then so are a~' and a. 


Proof Let f(X) = ay, X" + an—1X"~! +---+a,X +9 € Q[X] \ Q be such that 
ay # Oand f(a) = 0. Then, g(X) = aoX" +a,X""!4---+an_1X +a isa 
nonconstant polynomial with rational coefficients, and it is immediate to check that 
g(a!) =0. 

For a”, notice that there exist u, v € Q[X], at least one of which nonzero and 
such that 


f (X) = u(X?) + Xv(X*). 
Letting h(X) = u(X)* — Xv(X)*, we have h € Q[X] \ {0} and 
h(a”) = u(a)? — a? v(a7)? 
= (u(a*) — av(a’))(u(a) + av(a’)) 


= (u(a?) — av(a’)) f (w) = 0. 


Theorem 20.12 Ifa, 6B € C \ {0} are algebraic, then so are aB and e 


Proof By the previous lemma, a and f? are algebraic. However, since we already 
know that a + 6 is algebraic, it follows once more from the previous lemma that 
(a + By is algebraic too. Now, since 


1 


ap = 5((a+ py’ — a? — B*), 


two applications of Theorem 20.9, together with the result of Problem 1, assure that 
af is also algebraic. 
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For what is left to do, note that 3 =a: a 
lemma. Hence, the first part above guarantees that Z is algebraic. oO 


with 3 being algebraic by the previous 


Problems: Sect. 20.1 


1. * If r is a nonzero rational and a € C \ {0} is algebraic, prove directly (i.e., 
without resorting to Theorem 20.12) that ra is also algebraic. 

2. Given k,n €N, prove that cos oa and sin 2kn are algebraic. 

3. Let p be a prime number and n € N. Prove that the minimal polynomial of 7/p 
is f(X) = X” — p. 

4. * Leta € C be algebraic. If there exists f € Z[X]\ Z monic and such that 
f(a) = 0, prove that py € Z[X]. 

5. Let aj, a2, ..., a be natural numbers, and a = ,/aj + ,/az+---+./dn. Prove 
that: 


(a) @ is a root of a monic polynomial with integer coefficients. 
(b) Ifa ¢ Z, then a is irrational. 


6. (Brazil) Prove that the polynomial f(X) = X° — X4 — 4X3 + 4X2 + 2 does 
not admit any roots of the form 2/r, withr € Qandn €N,n > 1. 

7. Give a proof analogous to that of Theorem 20.9 to show that a6 is algebraic 
whenever so are a and f. 

8. Leta € C be algebraic, with dp, =n, and define 


Qla) = {ay + aya +++» + ay—1a"~!; ag, a1, ...,dn—1 € QO 
={f(a); f €Q{X], with f =0or df <n-]}. 
The purpose of this problem is to show that Q(@) is a subfield of C.? To this 
end, do the following items: 


(a) Show that Q(q@) is closed for addition, subtraction and multiplication. 

(b) Given B € Q(q@), show that there exists a single f € Q[X] such that 
B= f(a), with f =Oordf <n-1. 

(c) For B = f(a) € Q(@) \ {0}, with f € QLX] such that df <n — 1, show 
that gcd(f, py) = 1. Then, conclude that 7 € Q(a@). 


9. Given a, b,c € Qsuch thata+b/2+cV/4 0, show that there exist x, y, z € 
Q for which 
1 
a+ bV2+c/4 
Then, find x, y and zifa=b=1,c=2. 


=x+yV2+ 204. 


?For a converse, see Problem 4, page 503. 
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10. Let a € C be algebraic, with dp, = n, and Q(q@) be as in Problem 8. Find 
all functions ¢ : Q(a) — C satisfying the following conditions, for all u,v € 


Q(@): 


GQ) Gut v) = ol) + ov). 
(i) (uv) = d(u)g(v). 


20.2 Cyclotomic Polynomials 


The theory of polynomials over Zp, p prime, allows us to present some of the most 
elementary properties of the so-called cyclotomic polynomials; in particular, we will 
show that such polynomials are precisely the minimal polynomials of the complex 
roots of unity. As a byproduct of our study, we will use cyclotomic polynomials to 
prove a particular case of Dirichlet’s theorem on primes in arithmetic progressions. 

Given n € N, recall from Definition 13.23 that the primitive n-th roots of unity 
are the complex numbers of the form a, with wy, = cis = and 1 < k <n being 
relatively prime with n. In particular, there are exactly y(n) primitive n-th roots of 
unity, where g : N — N stands for the Euler function. Given m,n € N, whenever 
there is no danger of confusion we shall write simply (m, n) to denote the gcd of m 
and n. 


Definition 20.13 For n € N, the n-th cyclotomic polynomial is the polynomial 
©,(X)= [[ (Xo). (20.5) 


l<k<n 


(k,n)=1 
It follows from the above definition that ®, is monic with degree 0®, = g(n). 
The coming proposition collects other elementary properties of ®,,. 


Proposition 20.14 Forn € N, we have: 


(a) X"-1l= To<ain ®7(X). 
(b) ®, € ZX]. 
(c) ®,(0) = 1forn > 1. 


Proof 
(a) First of all, we have 


[T] eco = [] ams] [] «-4%,) 


O<d|n O<d|n O<d|n 1<k<n/d 
(k,n/d)=1 


= [[ [] -o%®. 


O<d|n_ 1<k<n/d 
(k,n/d)=1 
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(b) 
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Now, note that each integer 1 < m < n can be uniquely written as m = dk, 
with d, k € N such that 0 < d | n and (k, *) = | (dis exactly d = gcd(m, n)). 
Therefore, the last sum above is clearly equal to 


n 
[[@& - on) =x" = 1. 


j=l 


Let us make induction onn € N, beginning with ®|(X) = X—1 € Z[X]. Given 
a natural number n > 1, assume, by induction hypothesis, that ®,, € Z[X] for 
every integer 1 < m <n. Then, if 


g(X) = [] a(x), 

l<d<n 

d|n 
we have g € Z[X] and, by (a), X” — 1 = ©®,(X)g(X). Since g is monic 
(for we already know that each ®,, is monic), Remark 14.13 guarantees that 
®, € Z[X]. 
For n = 2 this is a direct computation; for n > 2, apply the result of Problem 3, 
page 345. Alternatively, arguing once more by induction, start by noticing that 


X? — 1 = © (X)@2(X) = (X — 1)0(X); 
hence, ®2(X) = X + 1 and ®2(0) = 1. Let n > 2 and suppose, as induction 


hypothesis, that ®,,(0) = 1 for every integer 2 < m < n. Then, in the notations 
of the proof of (b), we have 


(0) = (0) [] 40) = (-1) [] ea) =-1, 


l<d<n l<d<n 


d|n d\n 
and it follows from X” — 1 = ®,(X)g(X) that 
—1= ©,(0)g(0) = —®, (0), 


as wished. oO 


Corollary 20.15 If p € Z is prime, then 


®,(X)=XP1E XP AL. XH1. 


Proof Item (a) of the previous proposition gives 


X? — 1 = 0) (X)®)(X) = (X — 1), (X), 
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so that 
@,(X)= XP 14 KP 24... 4 K4 1, 


oO 


Example 19.28 has shown that ®, is irreducible over Q, so that ®, is the 
minimal polynomial of w,. Theorem 20.18 generalizes this fact by proving that, 
for every n € N, the minimal polynomial of w, coincides with the n-th cyclotomic 
polynomial ®,,. We first need to establish a simple auxiliary result. 


Lemma 20.16 Let f, g € Z[X] and p € Z be a prime number. If g € Zp[X]\ Zp 
and 3” | f inZ,[X], then g | f! inZ,[X]. 


Proof If h € Z[X] is such that f = eh in Zp[X], we know that there exists a 
polynomial / € Z[X] such that 
F(X) = g(X)Ph(X) + pl(X) 
in Z[X]. Computing derivatives at both sides of this equality, we obtain 
f'(X) = 2g(X)g"(X)h(X) + g(X)Ph'(X) + pl'(X) 
in Z[X], and hence 
F(X) = B(X)(28’(X)A(X) + BOON (X)) 


in Z,[X]. Therefore, g | f’ in Z,[X]. g 


For our next result, recall that if w is an n-th root of unity, then Proposition 20.3 
guarantees that its minimal polynomial p,, divides X” — 1 in Q[X]. Then, Problem 4, 
page 486, assures that p, € Z[X]. 


Proposition 20.17 Letn, p € N be such that p is prime and p { n. If w is an n-th 
root of unity, then Py(X) = Pwr (X). 


Proof Let ¢ = w?. Since both w and ¢ are roots of X” — 1, item (b) of 
Proposition 20.3 shows that both p,, and p; divide X” —1. By contradiction, assume 
that p,, # pr. Then, the irreducibility of these polynomials assures, via Gauss’ 
Theorem 19.14, that p, pe divides X” — 1 in Z[X], say 


X" — 1 = po(X) pe (X)u(X) (20.6) 


for some u € Z[X]. 
If g(X) = pe (X?), then 


g(w) = pe(@?) = pr () =0 
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so that (once more by Proposition 20.3) p, divides g in Z[X]. Let v € Z[X] be such 
that pjv = g. In Z,[X], Problem 5, page 469 gives 


Pe(X)0(X) = B(X) = P(X") = (PB, (X))”, 


and Theorem 19.19 guarantees the existence of a monic and irreducible polynomial 
he Zp[X] such that h | Dos P¢ in Z)[X]. It follows from (20.6) that h(x)? | 
(X" — 1) in Zp[X], and the previous lemma gives that h(X) | 7X"~! in Zp[X]. 
However, since h is monic and n ¥ 0, by applying once again Theorem 19.19 we 
obtain 1 < 1 < n — 1 such that h(X) = X! in Z,[X]. Hence, h(X) | (X” — 1) in 
Zp[X], which is a contradiction. oO 


We can finally state and prove the desired result. 


Theorem 20.18 Jf @, = cis 2H, then Po, = Pn. In particular, Py, € Z[X] is 
irreducible in Q[X]. 


Proof Take k € N such that k > 1 and gced(k,n) = 1, and letk = p,... py, 
with p,,..., pj being primes not dividing n. Repeated applications of the previous 
proposition give us 


= a Pipp2 ee = Py-P] == 7 
Pon Pohl Pipl 4 Pop! : Pot 


In particular, the p(n) complex numbers ok, with | < k <n and gcd(k, n) = 1, are 
distinct roots of pw, so that 


IPon > y(n) = 09,. 


However, since ®,, is monic, has integer (thus rational) coefficients and wy, as a root, 
the definition of minimal polynomial assures that py, = Pn. Oo 


We finish this section by proving a particular case of the famous Dirichlet’s 
theorem on primes in arithmetic progressions. This theorem states that an infinite 
nonconstant arithmetic progression of natural numbers contains infinitely many 
prime numbers, provided its first term and common difference are relatively prime. 
Although Dirichlet’s theorem is a natural generalization of Euclid’s Theorem 6.38, 
its known proofs are far beyond the scope of most undergraduate curricula in 
Mathematics. Nevertheless, the fragments of the theory of cyclotomic polynomials 
developed so far allow us to present an elementary proof of Dirichlet’s theorem, in 
the particular case of an arithmetic progression whose first term is equal to 1. 


Theorem 20.19 (Dirichlet) [fn € N, then the arithmetic progression 1, 1+n, 1+ 
2n,... contains infinitely many primes. 


Proof Let pj,..., py be any primes and ®, denote the n-th cyclotomic poly- 
nomial. Since ®, is monic, choosing a sufficiently large integer y we have 
®,(ynp,... px) > 1. Therefore, letting a = ynp, ... pr, we have modulo a that 


®, (a) = ®,(0) = 1 (moda). 
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Hence, ®,(a) = aq +1 = ynp...pxrq + 1, for some gq € N. If p is a prime 
factor of ®,(a), then p ~ pj,..., px and gcd(p,n) = 1. Therefore, if we prove 
that p = 1 (modz), it will follow that p is a prime in our arithmetic progression, 
distinct from pj,..., Pk. 

For what is left to do, note that gcd(p,a) = 1, and let t := ord p(a). Since 
P| ®,(a) and ®, (a) | (a” — 1) (for ®, (X) | (X” — 1) in Z[X]), we conclude that 
a” = 1 (mod p); thus, item (c) of Proposition 12.2 shows that t | n. If we prove 
that t = n then, again from item (c) of Proposition 12.2, together with a?~! = 
1 (mod p), we obtain n | (p — 1) or, which is the same, p = | (modn). 

For c € {a,a + p}, it follows from a‘ = 1 (mod p) that c’ = 1 (mod p). By the 
sake of contradiction, assume that t < n. Proposition 20.14, together with the fact 
that ¢ | n, furnishes 


c"-1= [] w=, [] uo 


O<d|n O<d<n 
d\n 


= ,(c)h(c) |] bac) 


O<d|t 


= ,(c)h(c)(c' — 1), 


for some appropriate polynomial h € Z[X]. However, since c = a (mod p), item 
(c) of Proposition 10.6 gives 


Pn (c) = Pp (a) = 0 (mod p), 
so that 
c" —1= ©, (c)h(c)(c! — 1) = 0 (mod p’). 
On the other hand, 
n—-1 
(a+ py" -—1l=a"—-1+4+ > (“Jan-tp! 
j=l 


and (by the computations above) both (a + p)” — 1 and a” — 1 are multiples of p?. 
Thus, looking at the last equality above modulo p”, we conclude that 


na"—' p = 0(mod p’), 


which is a contradiction. oO 


For a self-contained discussion of the general case of Dirichlet’s theorem, we 
refer the interested reader to [27] or [36]. As far as the special case above is 
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concerned, the coming example shows a nontrivial application. At this point, the 
reader may find it helpful to recall the material of Sects. 12.2 and 12.3. 


Example 20.20 For each prime number p, let g(p) € N denote the least positive 
primitive root modulo p. Then, the function p+ g(p) is unbounded. 


Proof Recall that if a is a primitive root modulo p then ar = —1 (mod p), and 
Euler’s criterion (cf. Proposition 12.19) assures that a is a quadratic nonresidue 
modulo p. In turn, given a natural number n > 2, we need to show that there exists 
a prime number p such that none of 1, 2, ..., is a primitive root modulo p. Then, 
it suffices to find p such that all of 1, 2, ..., are quadratic residues modulo p. 

We first claim that it is enough that every prime g satisfying | < q <nisa 
quadratic residue modulo p. Indeed, if this is so and 1 < m < n is not a perfect 
square, then m can be written as m = kay ... gi, With qi,...,q being pairwise 
distinct prime numbers. Letting g; = ai (mod p), we get 


m=ih’q...q. =k at...a? = (kay...aj)* (mod p). 


Now, Dirichlet’s theorem assures the existence of a prime number p of the form 
p = 1+ 2tn!, for some t € N. We claim that such a p works for us. Indeed, if 
1 <q < nis prime, then the Quadratic Reciprocity Law gives 


(2) (4) = ne) = i(@) =A 


for n! is even. However, since g | n!, we have p = | (mod q), and the properties of 
Legendre’s symbol (cf. item (a) of Proposition 12.23) give 


(i> 


Hence, (4) = |, and g is a quadratic residue modulo p. oO 


Problems: Sect. 20.2 


. Let p, k € N be given, with p being prime. Compute ® ,« explicitly. 

. Ifm > 1 is an even integer, show that ®2,(X) = ®2,(—X). 

3. If m and n are distinct naturals, prove that ®,, and ®, have no nonconstant 
common factors in C[X]. In particular, ®,, 4 ®y. 

4. Letn > 1 be a natural number and d be the product of the distinct prime factors 

of n. Show that ®,(X) = ®g (X"/“). 


5. (England) The set {1 oes “ai contains several arithmetic progressions. 


Noe 


V99 Be Ae 
Given an integer k > 2, prove that it contains an arithmetic progression of k 
terms which is not contained in any arithmetic progression of k + 1 terms of the 
same set. 
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6. Leta,n € N, witha > 1 and jn odd. Prove that the algebraic congruence x” = 
a (mod p) has a solution for infinitely many primes p. 

7. Let a be a natural number which is not a perfect square. Prove that there 
are infinitely many prime numbers p for which a is a non-quadratic residue 
modulo p. 

8. Let a,b € Z be such that for each n € N there exists c € Z for which n | 
(c? +.ac + b). Prove that the equation x? + ax + b = 0 has integer roots. 


20.3 Algebraic Numbers Over Z, 


This section is somewhat more abstract than the previous ones, for we extend the 
concept of algebraic number to consider algebraic numbers over Zp», for some prime 
number p. However, the payoff will be worth the effort, for, given n € N, we will 
be able to compute the exact number of irreducible polynomials over Z, and having 
degree n. 

We depart from a naive though profitable idea, namely, that there exists a number 
set 2p containing Z, that plays, for Z,, the same role as C plays for Q. We start by 
formalizing the concept of field. 


Definition 20.21 A field is a nonempty set K, furnished with operations +, - : K x 
K — K having the following properties: 


(a) + and - are commutative and associative, and - is distributive with respect to +. 

(b) There exist elements 0,1 € K, with 0 4 1, such thata +0 =a anda-1=a, 
for every a € K. 

(c) For every a € K, there exists an element —a € K such that a + (—a) = 0. 

(d) For every a € K \ {0}, there exists an element a~! € K such that a- a7! = 1. 


The reader has certainly realized what we mean by commutative, associative and 
distributive from his/her previous experience. Nevertheless, let us explain all that 
from first principles. Commutativity in item (a) means that 


a+b=b+a anda-b=b.-a, 
whereas associativity stands for 
(a+b)+c=a+(b+c) and (a-b)-c=a-(b-c), 


for all a, b,c € K. In turn, the distributivity of - with respect to + is exactly what 
one expects: 


a-(b+c)=a-b+a-c, 


with the right hand side being a shorthand for the more precise (though somewhat 
cumbersome) expression (a - b) + (a-c). 
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Thus, we surely have that Q, R and C are fields, and Problem 8, page 486 shows 
that so is Q(a), for every a € C algebraic over C. Moreover, in all of these cases, the 
elements 0 and | of item (b) in the above definition are the usual complex numbers 
0 and 1, and this is the reason why, for a general field K, we also denote them by 
0 and 1. A similar remark holds for —a and a™!, i.e., we are simply adopting the 
same notation we use for the additive (resp. multiplicative) inverses of elements of 
C (resp. of C \ {0}). 

Another class of examples of fields we have been dealing with is that of the finite 
fields Zp, with p € Z prime. In this case, however, we shall stick to the usage of 
writing 0 and 1 whenever convenient, in order to avoid any possibility of confusion 
with 0, 1 € Z. 

Back to a general field K, the cancellation laws for addition and multiplication 
hold: 


a+c=b+c>a=b anda-c=b-c,c40>a=b. 
Indeed, 


atc=b+c 3 (at+c)4+(-c) = (b+ 0c) 4+ (-c) 
>at+(ct+(-c)) =b+(c+(-c)) 
>a+0=b+05a=b5, 


and likewise for the multiplication (see Problem 1). 

As it happens within C, whenever there is no danger of confusion we shall write 
ab, instead of a - b, to denote the product of elements a and b of a general field K. 

We could have developed most of the theory of polynomials by considering the 
set K[X] of polynomials over (or with coefficients in) an arbitrary field K, with 
operations +,- : K[X] x K[X] — K[X] extending those of K. Taking for granted 
the (harmless) assumption that we have done that, we now have at our disposal the 
following concepts and facts, whose validities the reader can easily check: 


1. If f, g € K[X] are such that fg = 0 (the identically zero polynomial), then 
f =0org=0. 

2. The notions of degree (for nonzero polynomials) and roots for polynomials over 
K remain true, unchanged. Likewise, d(fg) = df + dg if f,g # O and o(f + 
g) < max{df, dg} if f +g #0. 

3. The division algorithm, the root test and Lagrange’s theorem on the number 
of distinct roots of a nonzero polynomial also continue to hold, with identical 
proofs. 

4. The concept of greatest common divisor for nonzero polynomials over K is a 
direct extension of that for polynomials over Q, and Bézout’s Theorem 19.2 is 
also true, with exactly the same proof. 
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5. Another concept that extends in a likewise manner from Q[X] is that of 
irreducible polynomial f e€ K[X]. Now, note that we have a version of 
Theorem 19.9 at our disposal too, again with the same proof. 


A major gap on extending the theory for polynomials over arbitrary fields is 
fulfilled by the coming result, which will be assumed without proof (we refer the 
interested reader to [6] or [20]). 


Theorem 20.22 Given an arbitrary field K, there exists another field Q containing 
K, whose operations extend those of IK and such that every f € Q(X] has at least 
one root in &2. 


The attentive reader has certainly noticed that Q plays the role of C for K. We 
refer to the property of Theorem 20.22 by saying that Q is an algebraically closed 
field containing K. Also as with C, one now proves that if f € K[X]\{0} has degree 
n, then there exists a € K (the leading coefficient of f) and a,...,@, € & such 
that 


f(X) = a(X — a1)(X — a2)... (X — an). 


The equality above is the factorised form of f over Q. 

We can now define, exactly as was done in Sect. 20.1, what one means for an 
element aw € Q to be algebraic over K, and consider its minimal polynomial py € 
KK[X] \ {0} as was done for a € C algebraic over Q. This way, Proposition 20.3 and 
Corollary 20.4 remain true, unchanged. 

We now restrict our attention to K = Z,, and write 2, to denote the field Q of 
Theorem 20.22. We first recall the result of Problem 5, page 469, which we write in 
the following form: 


f(X?) = f(X)?, V f € ZplX]. (20.7) 


We shall also need the following auxiliary result. 
Lemma 20.23 Let f € Qp[X] \ {0} be given. Ifa € Qy is a root of f, then: 


(a) a? is also a root of f. 
(b) There exists a natural number m < of such that a is a root of xP’ =X, 


Proof 


(a) It follows from (20.7) that f(a”) = f(a)? =0" =0. 

(b) Iterating the result of (a), we conclude that a, a”, a?”, ...are roots of f. Since 
it has at most df distinct roots, we conclude that there exist integers 0 < k < 
l < of for which a? = a, Therefore, 
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where we have used the result of Example 19.15 in the last equality above. It 
comes that a?” “ —a@ = 0, and a is aroot of X?” — X, withm =I1—k < of. 
| 


We are now in position to prove the following 


Proposition 20.24 Let p be prime and a € 2p be algebraic over Zp. If dpa =n, 
then: 


(a) a isa root of XP" — X. 
(b) a is not a root of X®" — X, for any positive integer m <n. 


Proof We already know, from the previous lemma, that @ is a root of X P™ _ X for 
some positive integer m <n. Now, let 


n times 


be defined by 


DG, 41,...,Gn—1) = Ao + ae +---+G,-10"!. 


We claim that ® is injective and each 6 € Im(®) is a root of X ee Indeed, 


if 

Dao, a1, callie. Gn-1) = O(bo, by, a eg bei) 
for distinct n-tuples (dp, @1,...,@n,—1) and (bo, b1,..., bn—1) in the domain of ®, 
then 


(Go — bo) + @ — bya +++» + Gn—1 — bn"! =, 


so that a would be a root of the nonzero polynomial (do — bo) + (@1 —by)X +--+ 
(Gn—1 — bn—1)X"—! of Zp[X]. Since dpq =n, this is a contradiction. 

For the second part, let B = ag + aja +--- + G,—1a"—!. The result of 
Example 19.15, together with Fermat’s little theorem and a?” = a, gives 


m = = 7 _ m 
BP" = (Go+Giat+--++G,—-10""')? 


m 


pin pm m pm = 
= ae a Pe EP DP 


=G+aja+---+a,_1;a"! 


=6 
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Let 7,, stand for the set of roots of X? "_¥X inQ p- The above claims then assure 
that |Rm| = p”, so that p” > p” and, hence, m > n. Since m < n, we then get 
m = n, and items (a) and (b) follow at once. oO 


A direct consequence of this proposition is the coming 


Corollary 20.25 Let p be prime and a € Qy be algebraic over Zp. If Ipa = n, 
then Pw | (xP" _ X) in Zp[X]. 


Proof This follows from the previous result, together with the analogue of Proposi- 
tion 20.3 in our setting. Oo 


Another consequence is collected as the next result. 


Lemma 20.26 Let f € Zp[X]\Zp be irreducible and of degree d. If f | (X?" —X), 
then d | n. 


Proof Ifa € Qy, is aroot of f, then f = py, and the previous corollary guarantees 


that f | (xe — X) and f { (xP — X), for every positive integer k < d. Since 
f | (XP" — X), we conclude that d < n. 
Now, let n = d + t and write 


at a eS ak aa x 
SF aay ee 
It readily follows from this equality that 
ged (X”" — X, y= X) = ged (X” — x, ie x). 


Assume that n = dq +r, with0O <r < d. Iterating the gcd equality above, we 
get 


gcd (X?" — x, xP" — xX) = ged (x? — x, x?" — X). (20.8) 


Since f divides the left hand side, it also divides the right hand side. In particular, 
f |x P" _ X), which is a contradiction. oO 


We can finally state and prove our main result, for which we let 
dy = #{monic, pairwise distinct irreducible polynomials of degree n in Zp[X]}. 


Also, if dn > 0, we write fni, fn2,---» fna, to denote such polynomials. 


Theorem 20.27 Let p be prime andn € N. Then, 


xP —X= |] far(X)..- faag(X), (20.9) 


O<d|n 


with the product fa)... faag taken as | ifag = 0. 
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Proof The analogue of Theorem 19.9 for Zp[X] assures that X b" _ X is the product 
of finitely many irreducible polynomials, which can all be assumed to be monic. 

If f is one such polynomial, with 0f = d, the previous lemma shows that d | n, 
so that f is one of the polynomials in the right hand side of (20.9). Conversely, if 0 < 
d|n,1< j < aq anda € Q, is aroot of fy;, then fy; = pa, and Corollary 20.25 
guarantees that f4; | (x Pa ) in Z,[X]. However, since d | n, the argument in the 


proof of the previous lemma leading to (20.8) shows that (x Ps. x ) | (x PX ) 


in Zp[X]. Therefore, fa; | (X?" — X) in Z,[X]. Oo 
Corollary 20.28 If p is prime andn €N, then 
ice y~ u(“)p4 > 0 (20.10) 
nA n d ’ . 
O<d\|n 


where ju is the Mobius function. 
Proof Computing degrees in (20.9), we obtain 
p" = dag. 
O<d|n 


Applying Mobius inversion formula (8.4), we obtain the given expression for ay. For 
what is left to do, just note that, since Im() C {—1, 0, 1}, we get 


Ndn = p™+ + H(5)p" = p" — > pé 
O0<d<n O<d<n 


d\n d\n 


n—-1 


p' =p 
2p =) pap eu 
d=1 P 
oO 


Corollary 20.29 Let p be prime andn € N. If0 < d | n, then XP" — X has an 
irreducible factor of degree d. 


Proof This follows from (20.9), together with the previous corollary. oO 


Problems: Sect. 20.3 


1. * Prove the cancellation law for the multiplication in a field. 

2. Let p be prime and, for n € N, let R,, denote the set of roots of X o = ¥ in p 
Show that R,, is a subfield of Q, containing Zp. 

3. In the notations of the statement of Theorem 20.27, show that az = (7) and 


Pid a. Pg =O? — KP" + 1. 


4. Let K be any field. Prove that K[X] has infinitely many irreducible polynomials. 
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As was said at the beginning of Sect. 20.1, transcendental numbers are precisely 
the complex numbers which are not roots of nonzero polynomials with rational 
coefficients. Nevertheless, up to this point we don’t even know whether such 
numbers do exist. 

One possible way of establishing the existence of (real) transcendental numbers 
is to start by showing that the set A of (real) algebraic numbers is countably 
infinite, i.e., that its elements can be put into a sequence (%,)n>1, so that A = 
{x1, x2, x3, ...}; then, one proceeds to showing that R is uncountable, so that R \ A 
is necessarily nonempty. Up to standard facts on countable and uncountable sets, we 
follow this path in the coming theorem, referring the reader to [8] or [33] for the 
necessary background. 


Theorem 20.30 The set of real algebraic numbers is countably infinite, whereas 
that of real transcendental numbers is uncountable. 


Proof Let A, denote the set of real numbers which are roots of nonconstant 
polynomials with rational coefficients and degree n. If we show that each A, is 
countably infinite, so will be the union 


A=) An 
n>1 


However, such a union is precisely the set of real algebraic numbers. 
For what is left to do, for each n € N let P,, denote the set of polynomials with 
rational coefficients and degree n. Since the function 


F:@ xQx---xQ— Pr@Q 
—— 
n=1 
given by 
F (Gn, Qn—1, +++, 40) = GnX" + Gn_1X" 1 +--+. + aX +a 
is a bijection with countably infinite domain, we conclude that P,, is countably 


infinite. Now, let ¢ denote the set of real roots of f € Q[X] \ {0}. Since each 
Ff € Py has a finite number of real roots, it follows that 


An= (J Re 
fEPn 


is a countably infinite union of finite sets, hence countably infinite (for it con- 


tains Q). 


Finally, since R is uncountable and A C R is countably infinite, we conclude 
that R \ A is uncountable. o 
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We now present the classical, explicit construction of the first transcendental 
number, due to Liouville. Up to details, our discussion follows the marvelous [12]. 

In the following, we say that an algebraic number a has degree n if its minimal 
polynomial p, has degree n; in particular, if > 1, then it’s clear that a is irrational. 
We shall also need another piece of terminology: we say that a property P(k), 
depending on a natural number k, is true for all sufficiently large k if there exists 
ko € N such that P(k) is true whenever k > ko; moreover, if a specific value for 
ko is irrelevant in the context under discussion, we only say that P(k) is true for all 
k> 1. 

We are finally in position to state and prove Liouville’s theorem. For the sake of 
comparison, at this point you might want to take a look at the statement of Dirichlet’s 
Lemma 7.7. 


Theorem 20.31 (Liouville) Leta € C be an algebraic number of degree n > 1. If 
(Pk )k>1 and (qx)k>1 are sequences of nonzero integers such that limg— +00 Pk — q, 


q 
then ‘ 
1 
fos, (20.11) 
qk Uk 
forallk > 1. 
Proof Let a, = 2. Since az my a, it follows that gx oe +oo. Since a is 


algebraic of degree n > 1, there exists f € Z[X] \ {0} of degree n and such that 
f(a) = 0, say, 


f(X) =a, X" +---+a,X +40, 


with ag, a1,...,4n € Z. 
We now observe that 
f (ax) 1 
= “| fax) — f(@)| 
Ak—a Ak —a 
1 n 
= ify j 
= . aj(a, — a 
soa ewilel -«) 
1 n . 
. J j 
= —————* aj|- |@, — a’), 
aa 2! Al log 
J= 


where we used the triangle inequality in the last passage above. Since ax a a, we 
have |a, —a| < 1 forall k >> 1. Under this assumption, we continue to estimate the 
last expression above by using some elementary algebra, together with the triangle 
inequality again, to obtain 
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n 


J j 
BF yi eae! 
lax — a| 


j=l 


f (ax) 


Aka 


n 
< ola j|((lax—eel +l)! + (lore = 1+ eel) el +---+la\4) 
j=l 


n 
=y laj|((1 + jal) Pace ja!) a eg sclle ja\/") 
j=l 


n 
< > jlajl(1+lal)/' =, 


j=l 


where C depends only on a, and not on k. 
Then, for all k >> 1 (chosen in such a way that |a, — a@| < 1), we have 


= |ax —a| > alflawh 


Now, since gx a +oo, we have jax — a| < | and gq, > C forall k > 1, so that 
‘a > x and, hence, 


Pk 
a 


1 
> —|f(ax)l, 
qk 


for all k > 1. 

As our final step, notice that if f(a,) = 0, then we would have f(X) = (X — 
ax)g(X), for some g € Z[X]\ {0} of degree n—1. Since a # ax (for, a is irrational), 
we conclude that a would be a root of g, which contradicts the fact that the minimal 
polynomial of w has degree n. Therefore, f(a,) 4 0 and, hence, 


n 
k k 
If (ox)| = an (2) er (2) +49 
qk dk 


n—- 


f n 1 n 
= Gilanvh +--+ a1 Peay + agg, | 
k 


1 
“n* 


Tk 


= 


Finally, combining this inequality with the previous one, we get (20.11). oO 
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The coming example brings a clever application of Liouville’s theorem to the 
construction of transcendental numbers. Along its discussion, we shall need some 
acquaintance with the arithmetic of convergent series, for which we refer the reader 
to Section 10.3 of [8]. 


Example 20.32 (Liouville) If a, a2,a3,... € {1,2,3,..., 9}, then the real number 


a a a 
Doh OR ce Mi 


a 10!! 102! 102! 


is transcendental. 


Proof Suppose that a were algebraic, of degree n > 1 (it is clearly irrational—see 
Problem 1). For k > 1, let 


1 
lov a | (10)"*" 
for all k >> 1. On the other, 
_ aj 1 = 1 
i> 
Therefore, for k >> 1, we should have 
1 1 


10@+DE1 ~ (108) 
and, hence, (k + 1)! — 1 < k!(n + 1). However, this last inequality is equivalent to 
k\(k —n) <1, 


which is false for all k > n+ 1. 
Finally, since the assumption that a is algebraic leads to a contradiction, we have 
no alternative but to conclude that a is transcendental. oO 


We finish this section by observing that the methods of real Analysis allow one 
to prove that e and z are transcendental numbers. Both these facts were established 
already in the nineteenth century, due to the works of the French mathematician 
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C. Hermite and the German mathematician F. Lindemann, respectively. Proofs are 
beyond the scope of these notes, and we refer the interested reader to [17] or [28]. 


Example 20.33 Once we admit the transcendences of e and 2, Theorems 20.9 
and 20.12 can be applied to establish the transcendence of several other numbers. 
More generally, if « € R is algebraic and # is transcendental, then a + 6 is 
transcendental. Indeed, if it algebraic we would have 


B= (a+ B)—a@, 


also algebraic. A similar argument shows that if a € R \ {0} is algebraic and f is 
transcendental, then wf is transcendental. 


Problems: Sect. 20.4 


. Prove that the number a of Example 20.32 is irrational. 
. Leta € C be algebraic and f € C be transcendental. Prove directly (i.e., without 
resorting to Theorem 20.9) that a + 6 is irrational. 
3. If a € C is transcendental and n € N, prove that %/a@ and a” are also 
transcendental. 
4. * Leta € C be transcendental, and define 


Noe 


Qla] = {ap + aja +--++ ana"; n €N and ao, aj,..., an € Qh. 


Prove that Q[a] is closed with respect to addition, subtraction and multiplication 
but is not a subfield of C. 

For the coming problem you might want to use the following fact, which will 
be stated without proof: if w € C is a root of a polynomial 


F(X) = an X" + ay_1X"1| +--+ +a X +0, 


with ao, a1, ..., Gn algebraic and a, ¥ 0, then a is also algebraic. A proof of this 
fact can be found in any standard text on Field Theory, for instance [6] or [20]. 

5. (OBMU) In the cartesian plane, let S be the set of circles whose centers have 
rational coordinates and whose radii have rational lengths. Show that there exists 
a regular 2016-gon, all of whose vertices do not belong to S. 


Chapter 21 ®) 
Linear Recurrence Relations Cheek for 


In this chapter we complete the work initiated in Section 3.2 of [8] (see also 
Problems 5, page 79, and 6, page 393), showing how to solve a linear recurrence 
relation with constant coefficients and arbitrary order. We first need to properly 
define the objects involved, and we do this now. 


Definition 21.1 A sequence (d,),>1 is said to be linear recurrent if there exist a 
positive integer k and complex numbers uo, ..., Uz—1, not all equal to zero, such 
that 


On+k = Uk—14n4+k—-1 + +++ + U0dn (21.1) 


for every n > 1. 


In the above notations, we sometimes also say that (21.1) is a linear recurrence 
relation; the natural number k is thus said to be the order of the linear recurrence 
relation (21.1). 

Given a1,...,a@% € C, it is immediate to verify (by induction, for instance) 
that there is exactly one linear recurrent sequence (dy),>1 satisfying (21.1) and 
such that a; = a; for 1 < j < k. Hence, a linear recurrent sequence of order k 
falls completely determined upon knowing the linear recurrence relation it satisfies, 
together with the values of its first k terms. 


Definition 21.2 Let (a,),>1 be a linear recurrent sequence such that 

Ontk = Uk—-1Antk-1 +++ + Ugan 
for every n > 1, with uo,...,ug%—1 being given complex numbers, at least one 
of which is nonzero. The characteristic polynomial of (a,,),>1 is the polynomial 


f € C[X] given by 


FCO SX" apg Sa ak = ay: (12) 
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21.1 An Important Particular Case 


In this section, we shall discuss how to solve a linear recurrence relation for which 
the corresponding characteristic polynomial has pairwise distinct complex roots. In 
spite of such a limitation, this case is much simpler to be dealt with than the general 
one and finds several interesting applications. The result we are interested in is the 
content of the coming 


Theorem 21.3 Let (dy)n>1 be a sequence satisfying, for every n > 1, the linear 
recurrence relation 


An+k = Uk-14n4+k-1 + +++ + UOan, 
with ug, ..., Up—1 being given complex numbers, not all equal to zero. If the complex 
roots Z1, Z2, ..., Z% of the characteristic polynomial of (ay)n>1 are pairwise distinct 


anda; =a; for|! < j <k, then 


—1 —1 —1 
Qn =2Z, X1+2, Kat +B x Vn= 1, 


where x1, ..., Xx is the solution of the Vandermonde system of equations 
Xi + XQ + +++ +X = a} 
ZiXy + 22XQ be + LXE = 2 
Ki $+ 25x20 +++ + 27 xK = 03. (21.3) 


-1 -1 k-1 
wy Xie XA +e + Me = OR 
Proof Since z1, Z2,..., Zk are pairwise distinct, Proposition 18.6 assures the exis- 
tence of a unique solution x;, x2,..., x% for the system of equations (21.3). We can 


thus let (b,)n>1 be defined by setting 
b, = gay + za hx feet ade Vn>1. 
This way, for | < j < k, system (21.3) furnishes 
b= ax + za hx, fone a xg ay = ie 


On the other hand, since f is the characteristic polynomial of (a,),>1, it follows 
from the definition of the b;’s that 
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Dn+k — Uk—1Pntk—1 — +++ — Uobn = 
k k k 
= n+k-1, | n+k-—2,.. n-1.. 
= ye tj te Ye, xpos uo) z xj 
j=l j=l j=l 
k 
= n—-ly 7k k-1 
=> xj(Zj Uk-12; uo) 
j= 
k 
n—-1 
=e 0. 
j= 


It follows from the above that (by ),>1 satisfies the same linear recurrence relation 
that (a,)n>1, and its first k terms respectively match the first k terms of (dy)y>1. 
Therefore, a straightforward inductive argument guarantees that a, = b, for every 
n > 1, as we wished to show. oO 


The coming example shows that we not necessarily need to actually know the 
roots of the characteristic polynomial of a linear recurrence relation, in order to 
apply the result of the previous theorem to obtain interesting information concerning 
its behavior. 


Example 21.4 (Crux) Let A; A2A3Aaq be the square of vertices Aj = (0, 1), A2 = 
(1,1), Az = (1,0) and Aq = (0,0). For each integer n > 1, let An+4 be the 
midpoint of the line segment A, Ay+1. Prove that, when n — +00, the sequence of 
points A, converges to some fixed point A, and compute the coordinates of such a 
point. 


Proof For each n > 1, let An(%, yn). The stated condition, together with the 
formula for the coordinates of the midpoint of a line segment (cf. Section 6.2 of 
[9]), gives 


2Xn44 =Xn41 +4Xn Vn >= I, 


an analogous recurrence relation being valid for the sequence (yn)n>1. 
The characteristic polynomial of the above recurrence relation is 


f(X) = 2X*- X -1=(X— )g(X), 


with g(X) = 2X3 + 2X2 42X 4+ 1. Ifa, b andc are the complex roots of g, then 
Girard-Viéte relations give 


a+b+c=-l1 and ab+ac+bc=1. (21.4) 
Hence, 


at+b?+c? =(a+b+c)* —2(ab+ac+ be) = (-1)* —2-1=-1 <0, 
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so that, thanks to Problem 4, page 386, we can assume that a is real and b and c are 
nonreal complex conjugates. In particular, a, b and c are pairwise distinct and, since 
1 is not a root of g, Theorem 21.3 guarantees the existence of complex constants A, 
B,C and D for which 


%, = Aa + Bo +Cer! 4D, Vin= 1. (21.5) 


For us to know what happens when n — +00, note firstly that g(—1)g(0) = 
—1 < 0 and, thus, Bolzano’s Theorem 17.2 assures that —1 < a < 0. Therefore, 
relations (21.4) give 


1=a(b+c)+bce =a(—1—a)+bb, 


so that |b|? = a(a+1)+1 < 1. However, since |b| = |c|, it comes that |b| = |c| < 1. 
Then, relation (21.5), together with the result of Example 21.8, gives 


lim x, =D. 
n—> +00 


In order to compute the value of D, recall that xj = x4 = 0 and x2 = x3 = 1. 
Hence, by successively letting n = 1, 2, 3 and 4 in (21.5), we obtain the linear 
system of equations 


A+B+C+D = 1 
Aa+Bb+Cc+Dd =0 
Aa? + Bb? + Cc? + Dd? =0° 
Aa? + Bb? +Cc3+ Dd =1 


Multiplying the last three equations by 2 and termwise adding the four equalities 
thus obtained, we arrive at 


Ag(a) + Bg(b) + Cg(c) + 7D = 3. 


However, since a, b and c are roots of g, we conclude that 7D = 3, and D = 3. 
In an analogous way, we prove that limp—+60 Vn = ;. Thus, 


. 3 4 
lim A,={-=, =]. 
n—>-+00 Td 


Problems: Sect. 21.1 


1. (OBMU) If f(x) = e~* sinx and f denotes the n-th derivative of f, compute 


fPOY ©). 
2. Let Ai (a1, db), A2(a2, b2),..., Ax (ag, be) be k > 2 given points in the cartesian 
plane. For eachn € N, let Ay+x be the center of mass of An, An+1,.--, An+k—1, 


1.e., 
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1 
An+k = j(Antk=1 a An+i + An). 


If Am(%m, Ym) for m > k, show that 


m_ kag + (k — l)ag-1 + +++ + 2a2 + ay 
kK+(k-M)+---+2+1 


Xm 


and 


m. kby + (k — 1)bg-1 + +++ +2b2 + by 
kK+(k—1)+---+2+41 


Ym 


3. (IMO shortlist) If a is the largest positive root of the polynomial X? — 3X? + 1, 
show that |a!788| and [a!%88| are both multiples of 17. 


21.2 Sequences, Series and Continuity in C 


In this section we partially extend the basic facts on convergence of real sequences 
and series (cf. Chapters 7 and 11 of [8], for instance) to functions with complex 
values. We observe that the material collected here (more precisely, Theorem 21.20) 
was used in the proof of the Fundamental Theorem of Algebra, in Sect. 15.3, and 
will be of fundamental importance for Sect. 21.3. We also show that it can be used 
to extend the method of generating functions to complex power series. 

Given a € Cand R > 0, we denote by D(a; R) the open disk of center a and 
radius R, i.e., the subset of C given by 


D(a; R) = {z € C; |z—a| < R}. 


Analogously, the closed disk of center a and radius R is the subset D(a; R) of C, 
given by 


D(a; R) = {z € C; |z—a| < R}. 


A set U C Cis open if, for every a € U, there exists R > 0 such that D(a; R) C 
U.Aset F C Cis closed if F° = C \ F is open. It is immediate to see that # and 
C are open (in the case of J, there is no way for the openness condition not to be 
satisfied, since there exists no z € Y). Hence, C = C \ and Y = C \ C are also 
closed. We now collect less trivial examples of open and closed sets. 


Example 21.5 For alla € C and R > 0, the open disk D(a; R) is an open set, 
whereas the closed disk D(a; R) is a closed set. 
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Fig. 21.1 Every open disk is - a ra 
an open set & y ™ 
/ 5) 
yee 
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Proof Let z € D(a; R) be arbitrarily chosen and r = R — |z — a|. Then, r > 0 and 
we claim that D(z; r) C D(a; R) (cf. Fig. 21.1), which will be enough to guarantee 
that D(a; R) is open. 

For what is left to do, take w € D(z; r). Triangle inequality gives 


|w—al =|(w—z)+@—a)|S|w—2zl+|z—-al 


A 


r+ |z-al=R, 


so that w € D(a; R). However, since this holds with every w € D(z;7r), we 
conclude that D(z; r) C D(a; R), as wished. 

For the second part, it is enough to show that U = C \ D(a; R) is open. To this 
end, take z € U and letr = |z — a| — R. Then, r > O and, as in the first part, 
we easily show that w € D(z;r) => w € U, which is the same as D(z; r) C U. 
Therefore, U is indeed open. oO 


We now turn our attention to sequences of complex numbers, extending to them 
the concept of convergence. 


Definition 21.6 A sequence (Z,)n>1 in C converges for a limit z € C, and write 
Zn > Z OF liMy_-++00 Zn = Z, provided the following condition is satisfied: 


Ve>0, dnp EN; n> 10> |[Zn -2| <€. 


Heuristically, the fulfilment of the condition above means that, as n —> +00, the 
terms z, come closer and closer to z. Indeed, the definition of convergence stipulates 
that (Zn)n>1 converges to z € C if, for an arbitrarily given radius € > 0, there exists 
an index no € N such that, for n > no, all of the terms z, belong to the open disk 
D(z; €). 

As in the real case, a subsequence (Zn,)x>1 Of a sequence (Zn)n>1 is the 
restriction of (Zn )n>1 to an infinite subset {n1 < nz < n3 < ---} of indices; froma 
rigorous standpoint, (Zn, )x>1 is the sequence f og: N — C, with f : N > C and 
g:N-— N given by f(n) = z, for every n € N and g(k) = nx for every k € N. 

The coming result brings two fundamental properties of the concept of conver- 
gence of sequences of complex numbers. In words, item (a) assures that the terms 
of a convergent sequence cannot have two distinct limits, and item (b) that every 
subsequence of a convergent sequence is also convergent and converges to the same 
limit. 


21.2 Sequences, Series and Continuity in C S11 


Lemma 21.7 Let (Zn)n>1 be a sequence in C. 


(a) If Zn > zand Zz, —> w, thenz = w. 
(b) If (Zn )k>1 is a subsequence of (Zn)n>1 and Zn — Z, then Zn, — Z. 


Proof 


(a) If z # w, then e = |z — w| > 0. However, since § is also positive and z, > z 
and z, — w, there exist natural numbers 1; and n2 such that |z, — z| < 5 for 
n > ny and |Z, — w| < 5 for n > nz. Hence, taking an index n > n1, 2 and 
applying triangle inequality, we obtain 


Iz— wl =|(2- Zn) + Zn — w)| 


S |Z —2nl + [Zn — wl 
€ € 


which is a contradiction. Therefore, z = w. 
(b) Given € > 0, the convergence of (Z;)n>1 to z assures the existence of np € N 


such that |z, — z| < € forn > no. Now, since nj < n2 < n3 < ---, there 
exists ko € N with k > kp > ng > no. Hence, for such values of k, we have 
lZn, — Z| < €, So that (Zn, )x>1 also converges to z. oO 


Thanks to item (a) of the previous lemma, if a sequence (z,,),>1 in C converges 
to z € C, we shall hereafter say that z is the limit of (Zn)n>1. 

For the purposes of these notes, one of the most important examples of 
convergent sequence is the one collected below. In this respect, see also Problem 2. 


Example 21.8 If |z| < 1 and z, = z” for every n > 1, then (z,)n>1 converges to 0. 


Proof Firstly, note that |z, — 0| = |z”| = |z|”. Now, if (a,)n>1 is the sequence 
of real numbers given for n > 1 by a, = |z|", then (cf. Example 7.12 of [8], for 
instance) a, — 0. Thus, given € > 0, there exists ng € N such that 0 < a, < ¢ for 
n > no. Thus, forn > no, we have |z, — O| = ay, < €, so that z,, > 0. oO 


Given a sequence (Z,,)n>1 of complex numbers, we can write Z, = X, + iy, for 
every n > 1, with x», y, € R. Our next result relates the convergence of (Zn)n>1 in 
C with those of (xn)n>1 and (¥_)n>1 in R. 


Lemma 21.9 Let (Zn)n>1 be a sequence of complex numbers, with zy = Xn + 1Yn 
and Xn, Yn € R for every n > 1. Then, (Zn)n>1 converges in C if and only if (Xn) n>1 
and (Yn)n>1 converge in R. Moreover, if x, — a and yn — b, then Z, — Z, with 
z=atib. 


Proof Firstly, suppose that z, — z, with z = a+ib,a,b € R, and let e > 0 be 
given. Since 


Xn — |, lyn — b| S lin — al? + |Yn — bi? = |Zn — Zl, 
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we have |x, — a|, |yn — b| < € whenever |z, — z| < €. However, since z, — z, 
there exists ng € N such thatn > no => |Z, — z| < €. Then, for each such natural 
n, we do have |x, — al, |¥n — b| < €, hence establishing the desired convergences. 

Conversely, assume that x, — a and y, — b, and letz =a+ibande > 0 be 
given. Since 


len — 21 = Vln — a? + lyn — BP. < lin — al + ln — BI, 


we have [Zn — z| < € if |xn —al, |Yn — b| < 5. However, since x, — a and yn, > b, 
there exist 1,2 € Nsuchthatn > n; > |x,—a| < 5 andn > n2 => |yn—b| < 5. 
If no = max{n1, n2}, then, forn > no, we have |x, — al, |yn — b| < 5 and, hence, 


|Zn — z| < €. This establishes the convergence of (Z7)n>1 to z. oO 


We now need to extend to complex sequences another concept related to the 
convergence of real sequences. 


Definition 21.10 A Cauchy sequence in C is a sequence (Z,),>1 satisfying the 
following condition: given € > 0, there exists ng € N such that 


m,n > ng => |[Zm — Zn| < €. 


If (Zn)n>1 is a convergent sequence in C, then (Zn)n>1 is a Cauchy sequence. 


Indeed, if z, — zande > Ois given, then there exists no € N such that |z, —z| < 7 
for every n > no. Hence, for natural numbers m,n > no, it follows from the triangle 


inequality for complex numbers that 


€ 
2 


€ 
em — Zn] S l&m — 21+ [2 —Znl S 5 + =€. 


Conversely, we have the following important result. 


Proposition 21.11 C is complete. More precisely, if (Zn)n>1 is a Cauchy sequence 
in C, then (Zn)n>1 Converges. 


Proof Let zy = Xn + iyn, with (%,)n>1 and (yn)n>1 being sequences of real 
numbers. Given « > 0, take no € N as in the definition of Cauchy sequence. 
Since |Xm — Xn| < |Zm — Zn|, it follows that (x,)n>1 is a Cauchy sequence in 
R. Therefore, Theorem 7.27 of [8] guarantees that (x,)n>1 is convergent, say for 
x € R. Analogously, there exists y € R such that y, — y. Hence, letting z = x+iy, 
Lemma 21.9 assures that z, — z. oO 


As in the real case (cf. [8]), given a sequence (Z,),>1 of complex numbers, we 
define the series! ps1 Zk aS the sequence (s,)n>1, such that sy = )-;_, Ze for 
every n € N. Also as in the real case, we say that s, is the n-th partial sum of 


'The reader must pay attention to the fact that we shall sometimes consider sequences (Zn)n>0 of 
complex numbers, so that the corresponding series will be denoted by }°,. Zk. 


21.2 Sequences, Series and Continuity in C 513 


the series, and that the series converges if there exists the limit s := limy_.+o0 Sn; 
moreover, in this case we shall say that s is the sum of the series. In symbols, we 
write 


provided the limit at the right hand side does exist. 
We now present an example of convergent series which will be of fundamental 
importance in the next section. 


Example 21.12 Given a € C \ {0}, show that for z €¢ D(0; a) we have 


Proof Lemma 13.18 assures that the n-th partial sum of the series is 


7 ,  1- (az)! 1 (Cia ale 
)_(a2)" = = Te 1—az 
= az az az 
Now, for z € D(O; uD we have |az| < 1, so that Example 21.8 gives (az)” — Oas 
n — +00. Hence, it follows from the equality above that, for z ¢ D(0; ae 


n 
1 n+l 1 
Y\(az)' = lim SV(az)k = ie a 
n—>+00 l—az n-+o 1l—-az 1l—az 
k>0 k=0 


oO 


Another concept of central importance is that of an absolutely convergent 
series, i.e., a series 7,9 2g of complex numbers such that the real series }°,.9 |Z«| 
converges. Forn € N, let s) = )-7_, Ze and % = > -y_1 |zel, So that (ta)n>1 
converges. The triangle inequality for complex numbers furnishes, for integers 
m>n, 


m m 
Sm — Sn| = > Zk| S a IZk| = tn — tn. 
k=n+1 k=n+1 


Now, since (t,)n>1 converges, it is a Cauchy sequence; hence, given € > 0, there 
exists ky) € N such thatm > n > ko > tm — ty < €. Therefore, we also have 
Sm — Sn| < € form > n > ko, so that (sp)n>0 is a Cauchy sequence too. Thus, 
Proposition 21.11 assures its convergence, and we have proved the coming. 


Proposition 21.13 Jn C, every absolutely convergent series is convergent. 


Before we can go on, we need to extend the concept of continuity for functions 
defined on a subset of C and assuming complex values. 
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Definition 21.14 Given a nonempty subset X of C, we say that a function f : X > 
C is continuous if the following condition is satisfied: for every sequence (Zn)n>1 
of points of X, if z, > z, with z € X, then f(zn) > f(z). 


Our next result will allow us to establish the continuity of complex polynomial 
functions. 


Proposition 21.15 Jf f, g : X — C are continuous functions, then so are f + 
g,fg:X ~C. 


Proof Let a sequence (Z,)n>1 in X be given, such that z, — z, for some z € X. 
Firstly, the triangle inequality for complex numbers gives 


If + 8) Gn) — (F + 8)@) =F Ga) — F@) + En) — 8@I 
SIF Gn) — F@I| + len) — 8@)I- 


Now, given € > 0, since f(z.) > f(z) and g(zn) > g(z), there exists ng €¢ N 
such that 


€ 
n>no = |f Zn) — fl, gn) — g(z)| < 5" 
Therefore, also for n > no, the above estimates show that 


If + 9)Zn) — fF +91 < = + 5 =e 


and, hence, that (f + g)(zn) > (f + g)(2). 
For fg, and applying twice the triangle inequality, we obtain 


I(fs)Zn) — FO! =lfEn)sn) — F)s@| 
<= If Gn) — F@Ils@n)l + |F@Olls Gn) — 8@)| 
<= |f Gn) — F@|lsGn) — 8@)| 
+ |f Ga) — FANE + lF@leG@n) — s@l- 


As before, given € > 0, we can take no € N such that n > ng implies 


If @n) — FI < min /< sere} 


and 


. € = 
jee) gol < min |, 
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Then, for a natural number 1 > no, it follows from the above that 


€ € € 
(fs) Zn) — FOI S fE-fE+ Xle@l+)” Ig (z)| 
€ 


+IF@I- 


3Uf@1+ 1) 


2 oe ese 
3°33” 


so that (fg)(Zn) > (fg)(z). a 
Example 21.16 Given n € N and ao, a,...,4n-1,4) € C, with a, # 0, the 


complex polynomial function f : C > C, given for z € C by 


1 


ff (Z) = nz" +an-12" +++ Faiz +40, 


is continuous. 


Proof Obviously, constant functions and the identity function z +> z are continuous 
(just apply the definition). Thus, by applying several times the second part of the 
previous proposition, we conclude that, for an integer 0 < k < n, the function z b> 
axz* is also continuous. Now, applying several times the first part of the previous 
proposition, we conclude that the sum of such functions, when k varies from 0 to n, 
is also continuous. However, such a sum is precisely the function /. Oo 


We now turn to the discussion of power series in C, namely, a series of the form 
ys k0 axz*, for some given sequence (da, ),>09 of complex numbers. 

Assume that the sequence (X/Jax|)x>0 is bounded, say with /[ax[ < A for 
every k > 0 and some A > 0. Then, lagz*| < (Az)* for every k > 0, so that 
Example 21.12 guarantees that the series )>,.9 |axz*| converges in D(0; R), where 
R= i In turn Proposition 21.13 then assures that the power series Deco az* 
converges in D(O; R). We abbreviate such a situation simply by saying that D(O; R) 
is a convergence disk for the power series under consideration. The next result 
assures that the function f : D(0; R) — C thus defined is continuous.” By the sake 
of notation, we make the convention that }~ K0 % zk = ao for z = 0. 


Proposition 21.17 Jf D(O;R) is a convergence disk for the power series 
re ae then the function f : D(O;R) — C, given for z € D(O; R) by 
f@ = Vieo axzk, is continuous. 


? Actually, one can show that f is an example of a holomorphic (i.e., complex differentiable) 
function. However, since we shall not need this concept, and in order to maintain the elementary 
character of these notes, we will only establish the continuity of functions defined by power series. 
The interested reader can find the relevant results and definitions in [11]. 
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Proof For a fixed z € D(0; R), take w € C such that |w — z| < 5(R — |z|). Then, 
|w| <r, withr = 5(R + |z|) < R, and for k € N we have 


[w* — z*) = |w —2|Jwe 1 + wh2z 4 ---t wk? 4 AO 
k-1 k-2 k-2 k-1 
= wz) (w se wel le | ee [elle lal) 
<jwo geht prt? pees pr? 4p) 


= kr*|w — zI. 


Hence, under the above restriction on w, Problem 4 yields 


fw) = F@ =| Dracw* — Yrauz*] =| Dax cw* - 24 
k>0 k>0 k>0 


k k k k 
= | axcw' = 25] = Yo lagllwt = 2 


k>1 k>1 
1 k 

< =( So klaxlr*)|w = cl. 
e k>1 


Now, fix a real number c such that 1 < c < x Since Ja;] < z fork > 0 


and Wk > 1 (cf. Example 7.23 of [8], for instance), there exists kg € N such that 
Vklax| < % for k > ko. Hence, 


cr\k 
> klax|r* < y (=) = C < +0, 
k>ko k>ko 


cr 


for0 < & < 1. Finally, letting C’ = yy klag|r*, it comes that 


A 


ko 
fw) — FOL Ss —(Doklaele*) tw — 21+ =(D kleulr*) iw = 2 
k=1 


k>ko 
j C 
jw —z| + —|w— zl. 
r r 


IA 


In short, we have shown that 


1 
ws] = GR le) lf) — F@)) <= Ales), 


for some positive constant A. But this being the case, Problem 8 shows that z, > 
z=> f (Zn) > f(z), and the fact that z ¢ D(O; R) was arbitrarily chosen guarantees 
the continuity of f. Oo 
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The coming corollary assures that if a function f : D(O; R) —> C is given by a 
power series, then such a series is uniquely determined. 


Corollary 21.18 Let >°;,. axz* and Veo byz* be convergent power series in the 
open disk D(O; R). If )°x>0 arz* = Dko byz* for every z € D(O; R), then ag = by 
for every k > 0. 
Proof By evaluating equality }°,. az* = eso byz* at z = 0, we obtain 
ao = bo. Then, cancelling out ao = bo in both sides of eso az = 
ee byz*, we get esi az = es byz* for z € D(O; R) and, hence, 
yy apzk-! = eet byzk-! for z € D(0; R) \ {0}. However, since both sides 
of this last equality define continuous functions on D(0; R), we conclude that 
Ye Us! = Vey beck! for every z € D(0; R). Now, looking at such an 
equality at z = 0, we obtain aj = b,. By proceeding this way, an easy induction 
gives ay = by for every k > 0. oO 
As a first application of the ideas developed here, we now present a proof of the 
identities of Proposition 16.17 with the aid of complex generating functions. 


Example 21.19 For z,...,Zn € C, if 5; = sj(z1,...,Zn) denotes the i-th 
elementary symmetric sum of z1,..., Z, and ox = a feet ee then: 


(a) Ontk = aly yon fork > 1. 
(b) seu = Gy (HDI sep joj for 1 =k n=. 


Proof Let 
n n 
FO) =[[d+2X) = Yo sjx!. 
j=l j=0 
On the one hand, we have f’(X) = ae pga’ on the other, for z € D(0; R), 
with 
R=min{|z;|~'; 1 < j <n and z; £0} >0, 
Problem 3, page 393, gives 


n 


f'@ = (doaa +22) )F@ = (Siepid"2/)F@ 
i=l 


i=l j=0 


= (TY cvid 2) 50 


= ( LC diaj412/) ( > siz'). 
j20 1=0 


We now apply the previous corollary in both cases, looking at both expressions 
for f’(z) as complex power series. In (a), computing the coefficient of z*—! in both 
expressions for f’(z) we obtain 
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k-1 
Ksx = a (—1)loj4is: = Soe tT og_i81 
1=0 


jtl=k-1 
so that 
k-1 
(1k ks — DOD! o¢-151 = 0. 
1=0 


Doing the same in (b), we get 


n 
0= 2 Clos = (DE ost 
1=0 


jtl=k-1 


and, hence, 
n 


DCD" ox-151 = 0. 
1=0 
oO 


We finish this section by proving a result that extends, for continuous functions 
f : D(a; R) > C, Theorem 8.26 of [8]. 


Theorem 21.20 (Weierstrass*) If f : D(a; R) => C is a continuous function, 
then there exist Zm and zy in D(a; R) such that 


lf Zm)| = min{| f(z)|; z € D(a; R)} 


and 


| f (zm)| = max{| f(z)|; z € D(a; R)}. 


Proof Let (Zn)n>1 be a sequence in D(a; R) such that 


f Zn) > supt{| f(z)|; z € Dia; R)} 


(here, in principle we do not exclude the possibility that such a supremum could be 
+00). Write 2) = Xn + typ, with xn, yn € R. Since |z,| < R for every n > 1, we 
have |xn|, |¥n| < R for every n > 1. Therefore, Theorem 7.25 of [8] guarantees the 
existence of an infinite set Nj C N such that (%n),en, converges to some x € R. 
Hence, by the same result we can take a second infinite set No Cc Nj such that 
(Yn)neN, converges to some y € R. However, since (x7)neN, iS a subsequence of 
(Xn)neN,, We conclude that (xy )nen, still converges to x. 

Letting zy = x + iy, it follows from Lemma 21.9 that (z,)nen, converges to 
zm. Hence, Problem | gives |zy| < R, so that zy € D(a; R). Now, invoking the 


Karl Weierstrass, German mathematician of the nineteenth century. 
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continuity of f, we have 


fm) = im f Zn) = sup{| f(z)|; z € D(a; R)}. 


néeNo 


In particular, 


sup{| f(z)|; < € D(a; R)} = max{| f(z)|; z< € Di; R)}. 
The proof of the first part of the theorem is analogous and will be left as an 
exercise to the reader. oO 


The result of the following corollary was used in the proof of the Fundamental 
Theorem of Algebra. 


Corollary 21.21 If f : C — C is a complex polynomial function, then, given 
R > 0, there exist zm and zy in D(a; R) such that 


lf Zm)| = min{| f(z)|; z € D(a; R)} 


and 


| f (zm)| = max{| f(z)|; z € D(a; R)}. 


Proof Example 21.16 and Problem 7 guarantee that |f| : C — [0,+00) is a 
continuous function. Thus, so is the function | f| : D(a; R) — [0, +00). It now 
suffices to apply Weierstrass’ theorem. Oo 


Problems: Sect. 21.2 


1. * Let (Z,)n>1 be a sequence in C, converging to z € C. Prove that (Z,)y>1 is 
bounded, i.e., that there exists M > O such that |z,| < M for every n > 1. 
Moreover, if |z,| < R for every n > 1, show that |z| < R. 

For the next problem, we say that a sequence of complex numbers is divergent 
if it is not convergent. 

2. If |z| > 1 and z, = z” for every n > 1, show that the sequence (Zy)n>1 is 
divergent. 

3. * Let (Zn)n>1 and (wy)n>1 be sequences in C, converging to z and w, respec- 
tively. Prove that: 


(a) Ifa € C, thenaz, —> az. 

(b) Zn Wn > ZEw. 

(C) ZnWn > ZW. 

(d) If wy, w 4 O, then zp)/wyn > z/w. 
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4.* If a,b € C and >, ze and )°,., we are convergent series of complex 
numbers, show that )°,. | (azz + bwx) is also convergent, with 


Y\(axk + bwx) = ay ck +b>~ wr. 


k>1 ke] k>1 


5. * Extend the comparison test for series to series of complex numbers. More 
precisely, let (dy)n>0 be a sequence of complex numbers. If there exist positive 
reals c and M such that |a,| < cM” for every n > 0, then the power series 

6. * Let @ #A X C Y Cc Cand f : Y — C be a continuous function. Prove that 
Six : X — Cis also continuous. 

7. * Given @ #4 X C C and a continuous function f : X — C, prove that | f| : 
X — [0, +00) is also continuous. 

8. * Let @ # X C Cand f : X > C be a function satisfying the following 
condition: given z € X, there exist A, B > 0 (at first depending on z), such that 


wex, |w-—z)<B>|f(w)—- f(z)| < Alw—-z|. 


n s 
eno 4nz" converges on the interval 


Show that f is continuous. 
9. * Ifa € C \ {0} and m € N, show that for z € D(0; a we have 


1 n+m—1 
(—azy" > ( m—1 Jaren 
n>0 


21.3. The General Case 


With the material of the previous section at our disposal, we can finally face the 
discussion of the general case of (21.1), i-e., that in which the complex roots of the 
characteristic polynomial (21.2) are not necessarily distinct. To this end, we shall 
use complex generating functions again. The fundamental result is given by the 
coming 


Theorem 21.22 Let (dn)n>1 be a sequence of complex numbers satisfying, for 
every n > 1, the linear recurrence relation 


On+k = Uk-14n4+k-1 + +++ + UOan, 
with ug,...,Ug—1 being given complex numbers and ug 4 0. Let z1, ..., z be 


the pairwise distinct roots of the characteristic polynomial (21.2) of (an)n>1, with 
multiplicities respectively equal to m,, ..., mj. Then, for every n > 1, we have 


Gn = pi(n — ie +-.-+ pin ia 
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where p1,..., pi € C[X] are polynomials of degrees less than or equal to m, — 1, 
..., m — I, respectively, and totally determined by the values of aj, ..., Ak. 
Proof We first claim that there exists a constant Ro > 0 such that |a,| < R” for 
every n > 1 and R > Ro. Indeed, if |a,| < R” for 1 <n < _m, withm > k, then 
lam| = |Uk—14m—1 +e) + U1dm—k+1 + U0am—k| 
S |ux—1[l@m—1] +++ + lua lam—411 + ollam—cl 
< late RO ego a [u|R™—*+1 ae |ug|R™—* 


= R™~ (\uy_1|RE1 +--+ + Jui |R + [wol). 


Hence, if g(X) = X* — |ug_,|X*—! —--- — |uy|X — |uo| and Ro > 0 is such that 
g(R) > Ofor R > Ro, then, for each one of such R’s, the estimates above give 
lam| < R"*(\ug—y[ RE! + +++ + [wil R + lwo!) 
< Rk . RK — R”. 
It thus suffices to choose, from the very beginning, Ro > O such that |aj|,..., |ax| < 


Ro and g(R) > 0 for every R > Ro. 
Now, fix R > Ro and let 


F(Z= ae 


n>1 


Since |a,| < R” for every n > 1, the comparison test for series (cf. Problem 5, 
page 520) guarantees the convergence of F in the open disk D(0; z) of the complex 
plane. Let f(X) = X*—uy_,X*~!—.---—u X —uo be the characteristic polynomial 
of (dn)n>1 and h(X) = —ugX* —u,X*—! —.---—ug_ 1X +1 its reciprocal (ah = k, 
since ug 4 0). For z € D(O; x we have 


k-1 


h@)F@) =—(ouje) (Doane) + Yo an" 


j=0 n>1 n=l 


k-1 
=-— > > UjanZ” ted + ~ itz” 


j=0n=1 n>1 


k-1 


j=0n>k—j+1 n>1 
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For j > 2, write 


k-1 
n n n 
> Ujan-k+jz = » Ujan—k+jZ + You jan—K4j2 
n>k—j+1 n=k—j+l n>k 


and analogously for )°,,.. @nz”. Then, we obtain 


h(F(z) =— > U0An—KZ" — ian, ei 


n>k+1 n>k 
-X( Ta janceasel + Diyenesj2') 

j=2. n=k—j+1 n>k 
+ Yoaet + ane! 

n=1 n>k 

n n 
= = Todt! = Yona agsz 

n>k+1 n>k 

k-1 

j=2n>k n>k 
FT meet + Soon 

j=2n=k—j+1 


However, since 


k-1 
TD ajaecaese = Otte 


j=2nek n>k j=2 


it follows from (21.1) that 


k-1 k-1 
. wot no __ . r n 
— DoD janes je" + Yaar" =D ( = Do ujan—e4j + an) 

j=2n>k n>k n>k j=? 

k-1 k-1 

k 
= ( _ SS ujaj+ ax)z + x ( _ ye Ujan—k+j + an)z" 
I= n>k+1 jae 
k-1 


= ( — > ujaj + ax) + > (UQAn—k + U1Gn—K+1) a's 


jae n>k+1 
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Therefore, 


h(2)F(z) =— D> wodn—ez” — Do wian—a412" 


n>k+1 n>k 
k-1 
k 
+ (- Souja; +ax)z + y (Ugan—k + U1dn—K41) 2" 
j=2 n>k+1 
k-1 k-1 k-1 
n n 
J=2 n=k—j+1 n=1 
k-1 k-1 
= n n 
= (a ak — ujaj)z — x oe Ujan—k+jZ + Yo anz ; 
j=l j=2n=k—j+1 n=1 


so that 


h(z) F(z) = zp(z) 
for z € D(O; x) with p € C[X] being a nonzero polynomial of degree dp < k —1. 
Since h(0) = 1 ¥ O, by increasing R, if needed, we can assume that h(z) 4 0 
for z € D(O; x): On the other hand, since 


F(X) = (K-72)... Kz), 


we have 
A(X) = (1 — 7X)... xy”, 
so that 
zp(z 
aie eee = 2)™ oa 
for z € D(0; 4). 


Now, observe that 
dp <k=0(1—2z2)"!...d — 7z)”). 


Hence, by applying to (21.6) the partial fractions decomposition formula (cf. 

Problem 9, page 457), we conclude the existence, for] < j </and1 <n; < mj, 

of constants dj, ;, uniquely determined by the coefficients of p (and, thus, by a), a2, 
-, Ae and uo, Uy, ..., Uz—1), Such that 
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dj; 
F@)= aS 3 a — i Ze (21.7) 
j=lnj=l =v 
for z € D(O; =). 
However, if r = min{z. a peas mh then the result of Problem 9, page 520, 


assures that 
1 (" +nj- ') ich 
——— ee . Z:Z 
— oer)hj > es j 
(1 — 252)" noo \ "i : 


for] < j </,1 <n; < mj; andz € D(O;r). Hence, it follows from (21.7) that, 
for |z| <r, 


mj 


Fe =? Dan ("7 ae 


j=l nj=l1n=0 


n=l \j=lnj= 


Finally, since a, is the coefficient of z” in the defining series for F’, the last 
equality above, together with Corollary 21.18, gives 


mj 


n+n;—-2 _ 
on = dn (OED er 
j=lnj=1 ie 
1 mj 
2p) Gh, tn ny DOH) -3).. (n+ Inzi- 


j=ln 


I 
= Di pin — D5, 
j=l 
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where 
mj d; 
; _ JET a — n—1 
p(X) = >> a_i yi try IX +nj — 2)... (K+ 024 
nj=l : 
= Cjmj 1X"! $+ + ej X +cj0 
is a polynomial of degree at most m; — 1. oO 


In order to present a relevant application of the previous result, we first need a 
definition. Throughout the rest of this section, AP stands for arithmetic progression. 


Definition 21.23 Given an integer m > 1, we call a sequence (a,),)>1 an: 


(a) AP of order 1 if it is an ordinary AP. 
(b) AP of order m if the sequence (by)n>1, given forn > 1 by by = an+1 — Gn, is 
an AP of order m — 1. 


The coming lemma characterizes AP’s of order m by means of a linear recurrence 
relation that generalizes those satisfied by AP’s of orders 1 and 2 (cf. Sections 3.1 
and 3.3 of [8]). 


Lemma 21.24 A sequence (an)n>, is an AP of order m if and only if 


m+1 m+1 m+1 
Antm+1 — fig eel Qn=0, (21.8) 
0 1 m+1 


for everyn > 1. 


Proof Firstly, let (a,),>1 be an AP of order m. If m = 1, then (a@,)n>1 is an ordinary 
AP, and (21.8) reduces to ay42 — 2an41 + a, = 0 forn > 1, the linear recurrence 
relation that characterizes ordinary AP’s. 

By induction hypothesis, assume that (21.8) is valid form = k — 1. Form =k, 
the sequence (b,)n>1 given by by = dn+41 — dy is, by definition, an AP of order 
k — 1. Therefore, the induction hypothesis gives 


at an i*(*\s, =0 21.9 
(5) ne- (1) mci t+ DE Yon = ( 9) 


for every n > 1, or 


k k 
(6) (An+k+1 = an+k) = ({) (ems = Gn+k—1) a 


k 
+ (—1)4 (1) (Gn41 — Gn) = 0 


(21.10) 
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for every n > 1; in turn, this is the same as 


(Jos -(@) (see (CQ) amr» 
sscat((E Gasca ens 


By applying Stifel’s relation, together with the facts that (5) = es and (') = 
(71). we get (21.8) for m =k. 

Conversely, let (a,)n>1 be a sequence for which (21.8) holds. If m = 1, then 
An42 — 2dn41 +a, = 0 for n > 1, and we already know that (a,),>1 is an AP. By 
induction hypothesis, suppose that the validity of (21.8) for m = k — | implies that 
(Gn )n>1 is an AP of order k — 1. Then, consider a sequence (a,)n>1 satisfying (21.8) 
for m = k, i.e., such that 


k+1 k+1 k+1 
( 0 Jansers = ( 1 Janse to FED EET lag =0 


(21.11) 


for n > 1. Using Stifel’s relation to write ee = (‘) + («)) for 1 < 
j. <_ &, together with ‘ey = (6) and () = (i). we successively reob- 


tain (21.11), (21.10) and (21.9), for every n > 1. Hence, the induction hypothesis 
assures that (b,)n>1 is an AP of order k — 1, so that, by definition, (a,),>1 is an AP 
of order k. oO 


We are finally in position to present the promised application of Theorem 21.22. 


Example 21.25 Tf (an)n>1 is an AP of order m, then (21.8) holds for every n > 1, 
so that the characteristic polynomial of (a,)n>1 is 


_ m+1 m+1 m+1 m me _yym+l m+1 
70) = ( 0 )x ( 1 )x +--+ (-1) (ee 


— (X _ tert, 
Theorem 21.22 thus guarantees the existence of constants a, a1, ..., @m Such that 
An = Ay + y(n — 1) +--+ + ay, (n — 1)” 


for every n > 1. Evaluating the above relation for n = 1, we get a9 = aj; forn = 2, 
...,m+ 1, we obtain a1, ..., @m as the solution to the linear system of equations 


Oy + 2 +++ + Am = a2 —a\ 
201 + 27a +++ +2” Om = 43 — a| 


may + may +++» +m™Otm = Am41 — a1 


21.3. The General Case 


Notice that the fact that such a linear system always has a single solution is an 


immediate consequence’ of the last part of the statement of Theorem 21.22. 


Problems: Sect. 21.3 


1. Let k be a given natural number and (a,),>1 be a sequence of complex numbers 


such that 


1 
an = 3 Gn-k + An+k)s 


for every natural number n > k. Show that 


k 
j= (Aj +(n— 1)Bj)oi *—) 
j=l 


for every n > 1, where w = cis and Aj, ..., Ag, By, ..., Be are certain 


complex constants. 
2. Find all real sequences (a,),>1 such that aj = 1, 


Gn43 = 5an42 — 8dn41 + 4p 


n 
for every n > 1 and 5; —> 3. 


‘This also follows from basic Linear Algebra, for the transpose of the matrix of the coefficients of 


the linear system is a Vandermonde matrix of nonvanishing determinant. 


Chapter 22 ®) 
Hints and Solutions hook for 


Section 1.1 


1. If there exists a bijection f : 1; — J,, show that n = 1. Now, assume that 
m,n > | and there exists a bijection f : I, — I,. Letting k = f(m), show 
that there exists a bijection g : I, \{k} — In-1, so that go fiz,,_, : Im—1 > In-1 
is also a bijection. Then, use an inductive argument to deduce that m—1 = n—1. 

2. Ifg: Im > Aandh: I, — B are bijections, define f : nin > AU B by 


letting 


k), ifl<k<m 
fay=feO . 
h(k-—m),ifm+1<k<m+n 
Since g and / are surjective andm+1 < k < m+nifandonlyif 1 <k-—m <n, 
we conclude that f is also surjective. For the injectivity of f, suppose that 
Stk) = f@O, with 1 < k,] <m-+-n. There are four cases to consider: 


© 1<k,l < m: then f(k) = fd) => gtk) = gi), and the injectivity of g 
gives k = 1. 

© m+1<k,l<m-+n: then f(k) = fQ) => h(k-—m) = hil —m), and the 
injectivity of h gives k —-m =1—~m, thusk = 1. 

© 1<k<mandm+1 </ < m+n: then f(k) = f() > gk) =h(—-m) e€ 
AM B, acontradiction to the fact that A and B are disjoint. Therefore, in this 
case it is always the case that f(k) ~ f(l). 

¢ m+1<k<m-+nand1 </ </mz:as in the previous case, we always have 


SAYA LO: 
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. By induction, if k <n and | Ui Aj| = Ys |A;|, then 


Caer \= =| (ui! -14j) U Agtil = Ut 4i| + |Ax+1| 


k+1 


k 
=) Ajlt+ lA] = > 1Ajl- 
j=l 


j=l 


. Let’s make induction on |A| = |B|, the case |A| = |B| = 1 being trivial. 


Suppose, by induction hypothesis, that the result is true when |A| = |B| = n, 
for some n € N, and let A and B be such that |A| = |B] =n +1.Givenx eA 
and a function f : A > B, let y = f(x). Then |A \ {x}| = |B \ {y}| = 7 and 
f restricts to a function g : A \ {x} — B \ {y}. By the inductive hypothesis 
g is injective if and only if it’s surjective. However, since f is injective (resp. 
surjective) if and only if g is injective (resp. surjective), it follows that f is 
injective if and only if it’s surjective. 


. If (a,b) = (c,d), then {{a}, {a,b}} = {{c}, {c,d}}. If a = b, then 


{{a}, {a, b}} = {fa}, fa, a}} = {ta}, ta}} = {{a}}, so that {{c}, {c, d}} must 
have just one element. Hence, c = d and {{a}} = {{c}}, so that a = c and then 
a=b=c=d. Thecasea ¥ b can be treated analogously, and will be left to 
the reader. 


. For the first part, if B = B,} U...UB,,a € A andb € B, show that the ordered 


pair (a, b) belongs to the left hand side if and only if it belongs to the right hand 
side. 


. By induction, if k <n and |A, x --- x Ag| = Ties |A;|, then 


|Ay x +++ x Ag x Agqil = |(A1 x +++ x Ag) X Aq 
= |A, x--- x Agl: [Ag+] 
k k+1 


=e |Acal = ne I. 


Now, observe that Proposition 1.6 deals with the initial case. 


. If A = {aj,..., a}, then, for a fixed integer 1 < i < n, there are exactly 


k possible choices for an index 1 < j < k such that a; €¢ Aj. Now, use the 
fundamental principle of counting. 


. After moves, the paws will be at a point (a, b) such that a = x1 +x2+--+-+Xn 


and b = yj + yo +--+ yn, where x;, yj € {—1, 1} for | < i <n. Hence, we 
have (i) —n < a,b < n and (ii) a and b have the same parity of n (this is, either 
a and b are odd, if n is odd, or even, if is even). Now, first certify yourself 
that any point (a, b) satisfying conditions (i) and (ii) can be a final destination 
of the pawn; then, use the fundamental principle of counting to show that there 
are exactly (n + 1)? of such points. 
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10. For item (a), if (aj,..., ax) is such a sequence, then aj,...,axr € {j,j + 
1,...,n}, so that there are exactly n— j +1 possibilities for each a;; then, apply 


11. 


the fundamental principle of counting. For item (b), observe that the set of the 
wished sequences is the difference between the set of the sequences considered 
in item (a) and the set of the sequences of k elements of J, (possibly with 
repetitions) such that the smallest term of the sequence is greater than or equal 
to j + 1. Finally, by item (b) the sum of item (c) is equal to)", j((m — j + 
pe-™— ps: verify that such a sum equals 1 OF 4 

For k > 3, let J be an odd integer such that 1 < J < 2k-! _ 1. Ror each of 
these I’s, let Ay = {2'I € A}, so that 


k-1_] 
A= [J 4,u {21 41,24143,...,2*- 1}, 
I=1 


with the index J varying over the set of odd integers from 1 to 2‘—! — 1. From 
this, conclude that if we wish a subset X C A with the stated property to have 
the greatest possible number of elements, then: (i) (ge! +1, 2*-! +3,..., Qk 
1} C X; (ii) for each odd integer 1 < J < 2‘~! — 1, at most half or [half| + 1 
of the elements of A; can be in X. Then, refine the analysis in (ii), showing that 
if a € N and J > 3 is an odd integer satisfying 27 < J < 2°*!, then at most 
==) + 1 elements of A; belong to X. From this, conclude that X has at 


2 
k—2 
k ko@=1 
ame) ed be | y eae 


most 
elements. Finally, separately consider the cases k even and k odd to show that 
Ss = gel (1k 

= 


Section 1.2 


1. 


2. 


Exhibit a bijection from the family of the subsets of J, containing n to the family 
of subsets of J,—1. 

For each x € A, show that there are 2” — 1 ways of choosing the subsets A; 
which contain x. 


. The n straight lines pairwise intersect at (5) = madd distinct points. Choose a 


line r which is not parallel to any straight line passing through two of these () 


points. There are two kinds of regions: those which possess a highest point with 
respect to r—which is necessarily one of the (5) points of intersection of two 
of the n given lines—and those which do not possess a highest point. Show that 


there is a bijection between the regions of the first kind and the set formed by the 
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3) points; then, show that there are exactly n + | regions of the second kind, so 


that the total number of regions is equal to ned +(n+ 1). 


. For item (a), consider the family of subsets of 7, containing n. For item (b), let F 


be an intersecting system in J, and denote by F;, the family F, = {A°; A € F}. 
Note that |F.| = |F|, 7 UF. = PU) and show that FN F, = @. 


. For item (b), for 1 < i < n let a; be the number of sets in Z,, which have i as 


central element. Item (a) gives a; = ay41~;. Hence, 


2) (A) = Doig: + YI @ 41 angi 


AETLy i=l i=l 


n n 
= Y iai + Yiint 1 — i)a; 
i=l i=l 


= iat Da = a+ dai 
i=l i=1 


=(n+DIT,|=@+)-2771. 


. For item (a), start by showing that, for and integer 1 < k < n — 1, there is 


a bijection between the partitions of m having k as one of its summands and 
the partitions of n — k. From this, conclude that the contribution of k to q(n) 
equals p(n — k). For item (b), show that if k is the greatest possible diversity 
of a partition, thenn > 1+2+4+---+k, so thatk < J2n. Then, notice that 
q(n) < kp(n). 


Section 1.3 


Nn 


. Show that there are exactly an+3 ways of writing n+4 as a sum of summands 1, 


3 or 4, with at least one of these summands being equal to 1; make an analogous 
reasoning to deal with a,+; and ay. 


. For an integer n > 3, let a, be the number of diagonals of a convex n-gon, so 


that az = 0. Let Ay Az... Ax Ag+) be a convex (k+ 1)-gon, so that Aj Az... Ag 
is a convex k-gon. Counting the number of diagonals of A,A2...AgAg+1 
departing from Ag+ 1, show that ag4) =azr +k — 1. 


. For item (a), consider a tiling of a 2 x (k + 2) checkerboard, and look at the 


possible ways of filling its upper right 1 x | square with a domino. For item (b), 
compute a; = |, az = 2 and, then, apply the results of Chapter 3 of [8]. 


. Adapt, to the present case, the solution of Example 1.17. 
. For item (a), fix an element x of A and let A’ = A\{x}. Start by showing that, if a 


subset B of A has an even number of elements, then there are two possibilities: 
(i) x ¢ B, so that B is a subset of A’ with an even number of elements; (ii) 


22 


10. 


11. 
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x € B, so that B \ {x} is a subset of A’ with an odd number of elements. Then, 
show that the correspondence B +> B \ {x} establishes a bijection between the 
family of subsets of A which contain x and have an even (resp. odd) number of 
elements and the family of subsets of A’ with and odd (resp. even) number of 
elements. For item (b), recall that a, + by, = 2”. 


. More generally, let f(n, k) denote the number of ways of choosing k noncon- 


secutive chairs out of a row of n chairs. Show that f(n,k) = f(n—2,k-—1)+ 
f(a — 1,k), then use such a recurrence to conclude that there are exactly 56 
ways of choosing the three chairs. 


. For (a), assume we have k + | disks piled at the leftmost rod. We obviously 


need at least a, movements to pile the k upper disks at the central rod; an extra 
movement puts the larger disk at the rightmost rod; finally, at least more ax 
movements move the k disks from the central to the leftmost rod. This amounts 
to a total of at least 2az + 1 movements, so that ay; = 2a, + 1. For (b), either 
make induction on n or note that ag4; + 1 = 2(ax + 1), taking into account that 
aj=l. 


. For item (b), note that there are two kinds of partitions of 7,41 into k + 1 


nonempty, pairwise disjoint subsets: those in which {n + 1} is one of the sets 
of the partition—there are exactly S(n, k) such partitions—and those in which 
n + 1 belongs to a set of the partition having at least two elements—there are 
exactly (k+1)S(n, k+ 1) such partitions. Note that item (c) asks us to compute 
S(7, 3), for which we can use the recurrence relation of item (b), together with 
the results of (a). 


. Let a, denote the number of desired sequences, and b, the number of sequences 


of n terms, all of which equal to 0, 1, 2 or 3 and having an odd number of 0s. 
Argue recursively to show that ay = 3aj—1 + by—1 and by = dy—1 + 3by 1 for 
all n > 2. Then conclude that agi; — 6ag + 8ax—1 = 0 for every k > 2 and 
compute aj = 2, a2 = 1 +3? = 10. Finally, apply the result of Problem 5, 
page 79. 

For item (a), use the fundamental principle of counting to show that there are 
exactly 3 - 2”! flags with n strips, for every integer n > 1. For item (b), start 
by observing that aj = 0 and a2 = 6; then, show that, for n > 1, there is a 
bijection between the set of flags with n + | strips in which the first and last 
strips have equal colors and the set of flags with n strips in which the first and 
last strips have distinct colors; finally, use the result of item (a). For item (c), 
use the result of (b) to show that dy42 = Gn41 + 2a, for every n > 1; then, 
solve this recurrence relation. 

For item (b), let A be a minimal selfish subset of J, containing n. Note that 
1 ¢ A (for, otherwise, A wouldn’t be minimal) and A + {n}, J,. We claim that 
the set A’ = (A \{n}) —1 is a minimal selfish subset of [,,-2. To check this, note 
first that A’ C I,—-2; on the other hand, |A’| = |A| — 1 and, since |A| € A, we 
have that |A] — 1 € (A— 1) C A’. Thus, A is a selfish subset of J,,-2. In order 
to show that A’ is minimal, assume that there would exist B’ Cc A’ such that 
B' # A’ and B’ was selfish. Then, prove that B = (B’ + 1) U {n} Cc A would 
be a selfish subset of J, containing n. Use the minimality of A to conclude that 


534 


12. 


13. 


14. 
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B = A and, hence, that B’ = A’, which is an absurd. Analogously, given a 
minimal selfish subset B of J,—2, prove that (B + 1) U {n} is a minimal selfish 
subset of J,,. For item (c), if a, denotes the number of minimal selfish subsets 
of I, it follows from items (a) and (b) that a, = ay,—1 + dy,—2 for every integer 
n > 2. Now, verify that aj = a2 = | and conclude that a, = F, for every 
n> 1. 

For item (a), note that there are three distinct types of nonempty subsets of Y: 
the nonempty subsets of X; the sets AU{m}, where A C X is nonempty; the set 
{m}. Then, write ay as a sum with three summands, each of them corresponding 
to the subsets of one of these three types; while doing this, note that if A C X 
is nonempty, then 7(A U {m}) = m(A)m. Item (c) follows from (b), together 
with a straightforward induction. 

For item (a), adapt to the present case the hint given to the item (a) of the 
previous problem. To this end, note that if @ 4 A C X, then o(A U {m}) = 
o(A)+m and 1(AU{m}) = 2(A)m. Item (c) follows from item (b) by an easy 
induction. 

Let Jo = {1} and, for every natural k > 1, let Jy = {2k-1 +1,..., ant. Show 
first that, for every k, there exists a good set By C {1,2,3,..., ay such that 
|Br| = be and Je C Bg, i.e., Be A Je = Jp. Since By is good and By N Je = Jk, 
we must have By M Jp_1 = YO. Now, let Xz_2 = Be {1,2,3,..., 24-7}. 
Clearly, every subset of a good set is also a good set; therefore, X;_2 is good. 
Conclude from this fact that by < bg_2+2*—!. Then, taking a good subset Y,_2 
of {1, 2, 3,..., 24-7} and looking at Yp_2 U Jy, show that by > by_2 + 2&1. 
For item (b), use the recurrence relation of item (a), together with telescoping 
sums (cf. Chapter 3 of [8]), to get by = ae el 


Section 1.4 


1S) 


. For item (a), expand the products in (x + y)” = (x + y)...(a + y) (w factors 


equal to x+y) and note that the monomial x”~* y* appears as many times as the 
number of ways of choosing a subset of k elements from the set {1, 2,..., m}. 


Then, generalize such an argument to get the result of item (b). 


. Letxy =--- = xy = Lin (1.12). 
. Notice that each (nonordered) partition of a set A of n elements into k nonempty 


subsets gives rise to k! ordered partitions of A into k nonempty subsets. 


. If a; is the only local maximum of a permutation (a1, a2, ..., 4) of In, then 


aj =n. Now, show that aj < --- < aj—; and aj4, > --- > dy, So that it 
suffices to choose the set {a), ..., aj}. Finally, observe that 7 may vary from 
l ton. 


. Show that each 6 of the n given points determine a single triangle satisfying the 


given conditions. 
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10. 


11. 


. Firstly, show that the total number of subsets with an even number of elements 


equals )°,59 (5;,)- Then, use item (b) of Corollary 4.25 of [8]. 


. Ifexactly 2k terms of the sequence are 0’s, show that there are exactly (ap) 6 ied 


sequences, so that the total number of them is }7,.9 (i) 3"—2k Now, note that 


- Cee _ 3 (Ge as (3. aa) 


k>0 k>0 

1 n n 
= gn 2k a) 

se (64 aaa 

k>0 
1 n n I n n 
= 3(3+) +(3-1)") = G +2"). 
. Consider a sequence (a1, ...,@n4k—1) of n+ k — | terms, with k — 1 of them 


equal to 0 and the other n equal to 1, and let 1 <i) <...<ip-) <n—-—k+1 
be the indices of the former ones. There are xj = i; — 1 > O terms that come 
before a;, (which is the i-th one), x2 = iz — i; — 1 > O terms between qj, 
and dj,,..., Xk = (n —k + 1) — ig_, = O terms that come after a;,_,. Since 
all of these are equal to 1, and there are exactly n terms equal to 1, we get 
xy +---+ x, =n. Thus, starting from a sequence (a1, ..., Gn4k—1) aS above, 
we get a solution, in nonnegative integers, of the given equation. Conversely, 
given a solution x; = jj,..., xx = jx in nonnegative integers of the equation, 
we can form a sequence (a1, ..., Gn+k—1) aS above, letting the terms of indices 
AFL, fat jot2, fit jot: +++ je-1+ (k— 1) be equal to 0 and the remaining 
ones be equal to 1. Therefore, by the bijective principle we only need to count 
the total number of such sequences, which is rather simple: it suffices to choose 
a subset of k — | elements of the set of J,+,—1 and, then, to impose that all the 
terms corresponding to the chosen indices are equal to 0, while the remaining 
ones are equal to 1. Proposition 1.26 gives ‘(gag exactly different choices. 
If yj = x; —1forl <i <k, then y; > Oand y+---+y, =n—k. Now, 
apply the result of the previous problem. 

For item (a), if A = {aj,..., ax} C I, is a set without consecutive elements, 
let x) = ay — 1, X41 =n — ay and x; = aj; — aj-; —2 for2 < j < k. Then, 
x; >Oforl <j <k+1,and 


k+1 k 
oxy = -— D+ DG; - aj-1 — 2) + 2 — a) 
j=l j=2 


= (aj — 1) + (@, — ay — 2(K—1)) +n- ay 
=n—22k+1. 
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For item (b), let aj = x) + 1 and aj = S23 +2 -—1)for2<j < 
Then, a; > 1,aj —aj-; = 2forl < j <k and 


k k 
n-Q=n a 2(k 1) = (n-2k+1)- ox tl amit 


i=1 i=l 


IV 


hence, a solution of equation ae x; = n—2k-+1 in nonnegative integers gives 
rise to a k-element subset of J, satisfying the conditions of Kaplansky’s first 
lemma. For item (c), the bijective principle, together with the results of items 
(a) and (b), guarantees that it suffices to count how many are the nonnegative 
integer solutions of equation x; +---+ x44, =n — 2k + 1. This follows from 
Problem 9, which shows that there are exactly (e sa Ms : = La y such 
solutions. 

Letting A C I, be such a set, there are two possibilities: (i) 1 € A or (ii) 
1 ¢ A. Incase (i), A \ {1} has k — 1 elements, no consecutive elements and must 
be contained in {3, 4,...,2 — 1}; apply Kaplansky’s first lemma to conclude 
that there are a) such sets. In (i), A can be any k-element subset of 
{2,3,...,} without consecutive elements; apply Kaplansky’s first lemma to 
conclude that there are (a) such sets. Finally, add possibilities. 

If A = {a1,..., ax} and f(a;) = x;, then (1.13) gives xj +---+ x, =n. Now, 
apply the result of Problem 10. 

Let A = {1,3,5,...,2n—1} and B = {2,4,6,..., 2n}, so that 4, = AUB,a 
disjoint union. Let F be the family of subsets of J, with equal numbers of even 
and odd elements, and let P be the family of subsets of /2, with n elements. 
Show that X bh (XM A) U (B \ X) establishes a bijection between F and P. 
Then, conclude that |F| = |P| = (7). 

For A C In, Problem 2, page 13, shows that there are (2” — 1/4! sequences 
(Aj,..., An) of subsets of J,, such that AyU...UA, = A. Now, forO < k < m, 
we know that J, has exactly (7) subsets A having k elements. Hence, letting S 
denote the desired sum, we get 


>. > |A,U...U Ay 


ACIm A1U...UAnp=A 


= » (2" — 1/4) Al = > (Te = kk 


ACIn k=0 


m(2" — 1) )- a 1)" — pk 
k=1 


= m(2" — 11+ 2" — yj"! 


S 


— m(2” _ por. 
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16. Let (i, 7) be an ordered pair of distinct integers chosen from 1, 2,...,n. Note 
that there are (n — 2)! permutations (a1, a2, ..., dy) of {1, 2,...,m} in which 
a, = i and a2 = j, and these contribute with (n — 2)!(i — ips for the sum 
ae So. However, the same pair (i, j) gives this same contribution when ay = 
i and ai; = j, for some k from 1 to n — 1. Therefore, (i — j)? appears 
(n — 1)(n — 2)! = (n — 1)! times in the sum )>,, S,. It thus follows that 


Yo Se = Do —IG - p, 
o Gj) 
and the average value of S, equals 
1 


— So = - Di - DIG - 7? = . dee - A’. 


g “GA Gj) 


A= 


In order to compute the last sum above, note that 


3c iy =-y-7" = Se? + 7-23) 


Gj) i=l j=1 i=1 j=l 


n n n n n n 
=)y yey yey yy 


i=1 j=l i=l j=l i=l j=1 


n 


=- ndoP end P-2(Sos] Sj 
i=l j=l j=l 


i=1 


= 


1 n n 2 
=- am Sr? -2(Soi) 
= 


i=l 


7 ant +)Qn+) _, é fe ») | 
n 6 2; 


= Goes). ("F') 
ns Sins Oe: lay © 


—_ 


17. Fix one of the 2” + 1 given points, draw the diameter passing through it 
and suppose that such a diameter leaves d of the remaining 2n points in one 
semicircle, and the other 2n — d points in the other one. Use combinations to 
conclude that the number of obtuse triangles having the fixed point as one of 
their vertices and an acute angle at this vertex equals d* — 2nd + 2n? —n 
(therefore, it is always greater than or equal to n(n — 1)). Adding over the 
2n + 1 given points, conclude that there are at least (2n + iG) = aerDe dD 
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obtuse triangles. a deduce that the total number of acute triangles is at 
most n(n+1)(2n+1) 
6 


Fixed 1 <i<n and 0 < j < min{i — 1,n — i}, Proposition 1.26 gives exactly 
(' 7) ways of choosing j elements of J, less than i and ‘- ) ways of choosing 
j elements of J, greater than i. Hence, the additive and multiplicative principles 
guarantee that 7 is the central element of 


ae lew 


sets A € Z,, with the above sum extending through all indices 0 < j < min{i— 
1,n — i}. For the second equality of item (a), there are at least two possible 
reasonings: 


(i) Computing the coefficient of x’! in both sides of the equality 


we get, with the aid of the binomial formula, 


(5) G2) 


jz0 
Gi) If xp44 = i and A = {x}, x2,...,x2%41} iS as above, write A = BU 
{x41} U B, with B = {x1,..., xe} and B = {xe42,..., xxi} if B= 


{fi+1,i+2,...,n—I1,n}\ B, show that the correspondence 
BU by }UBR> BUB 


establishes a bijection between Z, and the family of subsets of n — i 
elements of J, \ {i}. Since |Z, \ {i}| = n — 1, use Proposition 1.26 to 
conclude that 7 is the central element of — a sets in Z,,. 


The rest of the proof follows immediately from the lines’ theorem of Pascal’s 
triangle (cf. item (a) do Corollary 4.25 of [8]): 


n 


Yea =O (Fy)! 


AcIy i=l 


=o (7) +E) 
i=1 i=1 


— (n —_ 1) 7 gn-2 + gn-l = (n + 1) . gn-2. 
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19. 


20. 


21. 


22. 


For item (a), if f does satisfy the given conditions, then it is clearly injective, 
for f(x) = f2) > f(f@)) = f(f@2)) > x1 = x2. Now, it follows 
from the result of Problem 4, page 7, that f is a bijection. Assume that f has k 
fixed points. Fix a € I, such that a is not one of them and let b = f(a). Then, 
b #£ a and f(b) = a. Thus, we can group the remaining n — k elements of J, 
(all but the fixed points of f) in pairs (a,b), with b = f(a) anda = f(b), 
which implies that n — k must be even. For item (b), let 0 < k < n be such 


that n — k is even. For each choice of k elements x1,..., x, € J, and for each 
partition {{a,, bi}, ..., {a;, bj}} of the remaining n — k elements of J, there 
obviously exists a single function f : J, — I, having x1,..., xx as its fixed 


points and satisfying f(a;) = bj and f(b;) = a; for | <i < /. Observe that 
f(f(@)) = x for each x € J,. For item (c), once fixed 0 < k < n such that 
n — k is even, there are G) ways of choosing k elements of J, to be the fixed 
points of f. In turn, the number of ways of partitioning the remaining n — k 
elements of J, into disjoint 2-element sets is equal to 


n—-k—2 2 (n —k)! 

(2!) G)= ame 
Hence, the multiplicative and additive principles assure that the number of 
functions we wish to count is equal to the given number. 
Let |F| = k. To each A = {a, b, c} € F, associate its 2-element subsets {a, b}, 
{b, c} and {c, a}. The condition given in the statement of the problem assures 
that each 2-element subset of J,, is associated to at most one of the sets A € F. 
Thus, 3k < (a) which gives us the desired inequality. 
For each | < i < n, choose B; C A; such that |B;| = k. The assumption 
|A; 0 A;| < k—1 for 1 <i < j <n guarantees that the correspondence 
i +> B; 1s injective; indeed, if 1 < i < j <n are such that B} = B; = B, 
then |A; N Aj| = |B| = k. Now, {B1,..., Bn} is a family of pairwise distinct 
m-element subsets of A, so that Proposition 1.26 gives n < (h. 
For 1 < k <n, we let mz denote the number of P;’s which contain az; conclude 
that m, + --- +m, = 2n. If F is the family of sets {P;, Pj} such thati # j 
and P; 1 P; # W, use the given conditions to show that |F| = )°7_, ("3'). 
Now, consider f : F — {P1,..., Px} such that f = ({P;, Pj}) = Px, with 
Px = {a;, aj}; conclude that f is injective, so that 


(0) 


i=1 


and, then m+ +++ m> < An. Finally, use the inequality between the quadratic 
and arithmetic means (cf. problem 5.2.3 of [8], for instance) to conclude that 
Nias, 
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1. In each item through (a) to (d), evaluate the corresponding functions at x € A. 

2. How many natural numbers from 1 to 1000 have a prime factor less than 10? 

3. Use the result of Example 2.6, together with the fact that, since m < n, there is 
no surjective function f : In, > In. 

4. Let C),...,Cn be the n couples, A be the set of distinct lines of 2n 
people formed with the members of the given couples (and without further 
restrictions), and A; be the subset of A formed by those lines in which both 
members of C; occupy neighboring positions. We wish to count the number of 
elements of A \ (Aj U...U Az). To this end, apply the inclusion-exclusion 
principle, noting that |A| = (2n)! and |Aj, 1... Ai,| = (2n — k)\(2k for 
1l<ij<--++<ip<n. 

5. For 1 < j <5, let P; be the set of permutations (a1, a2,..., ag) of J¢ such that 
(a\,..., aj) is a permutation of J;. We wish to count |P \ Lea P;|, where P 
denotes the set of all permutations of J¢. To this end, if 1 <i; <--- < ip <5, 
conclude that 


[Pip 1... Pi,| = t1'G2 — i)! . ie — tx-1) (6 — ix)! 


Finally, apply the inclusion-exclusion principle and compute the value of each 
possible summand above. 

6. Let A= {1 <m <n; gcd(m,n) = 1} and, for! <i <k,letA; ={l<m< 
n; p; | m}. Then,/, \ A = Ly A; and, in order to compute |A\, it suffices 


to apply inclusion-exclusion. To this end, note that if 1 <i) <---<ij <k 
are integers, then m € Aj, 1... Aj, if and only if pj, ... pj, | m, so that 
|Ai,N...N Aj.| = —"_. 

J Pi Pi; 


7. Start by showing that, for q € N, the number of naturals less than or equal to 
n and divisible by q equals ak Then, let A; = {a € N; a < nand p; | 
a}, and notice that the number of primes less than or equal to n is given by 
n—|A,U...U Ag]. Finally, apply inclusion-exclusion. 

8. Let A denote the set of distinct partitions of n in natural summands and, for 
1 < j <n, let A; denote the subset of A formed by those partitions with at least 
one summand equal to 7. Then, p(m) = |A| and, for 1 <i) <... < ig <n, it 
is easily seen that |A;,...M Ai,| = p(n — 1), where 1] = ij +--+ + ix. Now, 
note that A = A; U...U A, and use inclusion-exclusion to get 


pPa=ICDE' YS) DI pd. 
k=1 


1=1 1<i| <...<ig<n 


ipte tip =l 


Finally, on the one hand, the last summand has exactly p;(/) summands; on the 
other, 1 <i) <...<ip <nandi; +---+ ix =/ imply 
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10. 


k(k+1)_ ke? 
n>I1l> ti taicdce = = 
2 2 
and, hence, k < /2n and! > e. 
. Let A be the set of solutions (x;,...,x,) of the given equation, with x, € Z+ 


for 1 < t <n. For 1 <i < an, let A; be the subset of A formed by those 
solutions (x1, ..., X,) in which x; > k. We wish to count |A \ (A, U...UAn)]; 
to this end, apply inclusion-exclusion, recalling (cf. Problem 9, page 29) that 
|A| = gun and showing that 


n—-1 


m—j(k+1)+n-1 
Jan. Ay = ( 7 ) 


n—1 
for 1 < ij < --- < ij << n. For this last computation, observe that 
(X1,--.,%n) € Ai, NN... MN Ai; if and only if Xj,5--++%i; > k + 1; hence, 


letting y, = x; fort Ai;,...,i; and y;, = x, —(k+ 1) for1 <1 < j, we have 
Y1,---,¥n = Oand yy +---+ yn, =m — j(k + 1); now, apply once more the 
result of Problem 9, page 29. 

For the first part of (a), note that there are 2 ways of choosing n pairwise 
nonadjacent chairs, and after that there are n! ways of distributing the men 
in them. For (c), note that if Ao;-1 Az; was not empty, then w; should 
simultaneously sit in chairs i — 1 and i; also, if Ao; M A2j+1 was not empty, 
then w; and w;+1 should both sit in chair 7. Item (d) follows from the fact that, 
for every intersection of the form A;,A;,...Aj,, exactly k of the n women 
have their positions entirely determined, whereas the n — k remaining ones can 
be permuted in any desired way. Finally, for (f) apply the inclusion-exclusion 
principle. 


Section 2.2 


BRWN FR 


. Explicitly write the summands at both sides. 

. Note that both sides count the number of subsets of a set having n elements. 

. Adapt, to the present case, the idea used in the proof of Example 2.9. 

. Start by choosing k odd elements and k even elements; then, use Lagrange’s 


identity. 
. Let Pi, Po,..., P11 be the patrols and hy, h2, ..., hy be the volunteers. Define 
a function 
f Sigsie inl = Ue eee Stay el 
by letting 


FAs) = (Pj, Pa) > fy enters P; and P;. 
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The conditions given immediately assure that f is a bijection, so that there are 
as many volunteers as 2-element subsets of the set of the 11 patrols. Thus, there 
aren = ()) = 55 volunteers. Now, let A be the set of ordered pairs (h;, P;) 
such that h; € P;. Since each of the 55 men enters exactly two patrols, A has 
exactly 2-55 = 110 pairs. On the other hand, if k is the number of members in 
each patrol, then P; is the second entry of exactly k ordered pairs in A (those 
whose first entries are the k members of P;); however, since there are precisely 
11 patrols, it follows that A has 11k ordered pairs. Therefore, 11k = 110 and, 
hence, k = 10. 
Show that both sides of the desired equality count the total number of fixed 
points of all permutations of [,. 
Let a), d2,..., Gm be the initial terms of the arithmetic progressions, and n > 
a,,...,@m a natural number. Let’s count the number of elements of J, in two 
different ways, the obvious one being |/,,| = n. For the other one, assume that 
the arithmetic progression of initial term a; and common ratio d; has exactly k; 
terms in J,,, so that ae d; = nand a; + (kj —1)d; <n < a; + kjd; or, which 
is the same, 

n— aj n— aj 
<kj < 7 +1. 


i i 


Adding all such inequalities for | <i < m, we conclude that 


m 


m 
n— dj n-aj 
dj é d; 
i=l i=1 


or, which is the same, 


Finally, let n > +00 to get >", + = 1. 


. Associate 0 or 1 to each line segment which is a side of a smaller triangle, 


according to whether its vertices have equal or distinct labels, respectively; 
then, associate to each smaller triangle the sum of the numbers associated to 
its three sides. Now, observe that the number associated to any smaller triangle 
is equal to 0, 2 or 3, being equal to 3 if and only if the corresponding triangle 
is labelled as ABC. Finally, use double counting to show that the sum of the 
numbers associated to the smaller triangles is always odd. 

Let x denote the desired number of commissions and y the total number of the 
remaining ones. Firstly, note that x + y = 4060. Now, fix an arbitrary senator 
and show that he/she enters exactly 268 commissions in which the other two 
members are both his/her friends or enemies. Finally, use double counting to 
conclude that 3x + y = 30- 268. 

Let A, A2... An be the n-gon, Ay A2M A, A3 = {X} and A2A4N A3As5 = {Y}; 
choose the following n — 2 points: a point B in the interior of Aj A2X, a point 
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C in the interior of A3A4Y and, for 5 < j < n, a point in the interior of 
A2A3A;NAj-1Aj;Aj;+1. With the aid of the fundamental principle of counting, 
show that the total number of triangles formed by three vertices of the n-gon 
and containing one of the chosen points is equal to 


2 -2)+ 1G -D@- jt D= i) 


J=5 


Section 2.3 


1. For item (a), we leave to the reader the task of showing that ~ is symmetric. For 
the reflexivity, let a € A be given; since ¥ is a partition of A, there exists B € F 
such that a € B. Therefore, a,a € B or, which is the same, a ~ a. Now, let 
a,b,c € A be such that a ~ b and b ~ c. There exist B,C € F such that 
a,b € Bandb,c € C. However, since F is a partition of A and BNC $ @ (for, 
b € BNC), we conclude that B = C; therefore, a ~ c. Finally, for a € A, take 
the set B € F for which a € B. Then, 


a={xeA;x~as={x eA; x,aeC, IC EF}. 


Invoking once more the fact that F is a partition of A, we conclude that B is the 
only element of F to which a does belongs. Thus, 


a={xeEeA;xeEBs=B. 


The analysis of item (b) is quite similar to that of (a) and will also be left to the 
reader; we notice that, in this case, if B is a finite set, then there are exactly | B| 
distinct equivalence classes. 

2. A~ A, for A = A+0 modulo 3. If A ~ B, with B = A+ 7 modulo 3, then 
B ~ A, with A = B+ (3 — j) modulo 3. Finally, if A ~ B and B ~ C, with 
B= A-+i modulo 3 and C = B + j modulo 3, show that C = A +k modulo 
3, where 0 < k < 2 is such that i + j =k (mod 3). 

3. If the involved sets were to be chosen in a specific order, then there would be 
exactly (, ae A = a distinct ways of partitioning the set as prescribed. Since 
the choice is actually unordered, we have to divide by a!. 

4. For item (b), use the result of (a), together with Proposition 2.16 and the 
definition of Stirling numbers of second kind. For item (c), use the result of (b), 
together with that of Example 2.6. 

5. Firstly, note that there are exactly m? — m non monochromatic sequences of 
p beads, each bead being of one of the m given colors. Apply the equivalence 
relation (2.16) to such sequences (with p in place of 1) to conclude that there are 
exactly m?=" distinct and non monochromatic necklaces. Since there are exactly 
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m monochromatic necklaces, there is nothing left to do in order to establish the 
first part. For the second part, it follows from the first that p divides m? — m = 
m(m?—! — 1), for every m € N. It now suffices to assume that p { m, using the 
remarks that precede the statement of the problem to conclude that p | (m?~! — 
1). 

6. In the set F of aperiodic sequences of size n, let ~ be the relation defined 
by setting (x1,x2,...,X%n) ~ (1, y2,---+, Yn) if and only if there exists 0 < 
k < n—1 such that yj = xj+x(modn), for 1 < i < n. Show that ~ is 
an equivalence relation, with respect to which each equivalence class contains 
exactly n sequences. Then, apply the result of Proposition 2.18. 

7. Apply Bollobas’ theorem, together with the fact that (in/2 ) is the largest 
binomial number with numerator equal to n. 

8. For the first part of (a), show more generally (by induction, for instance) that if 
f™ = Id for some m € N, then f is injective, hence bijective (thanks to the 
result of Problem 4, page 7). The second part of (a) follows from the associativity 
of composition, together with the fact that f” is a bijection for every j € Z (so 


thata = f(b) > b= (f)'@) = f(a). For the first part of (b), let 


a={...,f9@, f"@, 4, f@, FP @..-3 


be an equivalence class of ~ with more than one element, so that f(a) ¥ a. If 
j > lis the smallest integer such that f(a) = a (by assumption, f(a) = a, 
thus such a j does exist), write p = jg +r, withO <r < p, and use the 
minimality of j to conclude that r = 0; then, show that 7 = p. Conclude 
that a@ = {a, f(a), f(a), ..., f?~) (@}. The second part of (b) now follows 
from the first part, together with the fact that the equivalence classes form a 
partition of J,. Item (c) follows from the fundamental principle of counting, 
together with the fact that, given pairwise distinct elements a1, 42,...dp € In, 
there are exactly (p — 1)! possibilities for the restriction of f to {a1, a2, ...dap}, 
such that f({a1,a2,...ap}) = {a1,a2,...dap} and none of aj, a2,...dp is a 
fixed point. Indeed, in this case there exists a € J, for which {a), a2,...dp} = 
{a, f(a),..., f'?~?(a)}; however, since f'?)(a) = a, the number of desired 
bijections from {a1,a2,...dp} to itself amounts to the number of circular 
permutations of these numbers, i.e., (p—1)!. Finally, for (d), once fixed an integer 
0 < k <n such that p | (n — k), there are (i) ways of choosing k elements of 
I, to be the fixed points of f. In turn, the number of ways of partitioning the 
remaining n — k elements of J, into disjoint p-element sets is equal to 


n—k n—-k—p p\_ (—k)! 
P Pp p) p/p" 
Hence, the multiplicative and additive principles assure that the number of 
functions we wish to count equals the given number. 
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9. For item (a), show that if A, B C J[, are distinct subsets with k > 2 elements 


each, then AM B VY. For item (b), take F as the family of subsets of J, formed 
by the k-element subsets of J, which contain n. For item i., fix A € F compatible 
with @; we can assume, without loss of generality, that A = {a),...,a,} and 
Oo = (aj,..., 4k, Ak+1,---,4n). Group the remaining k-element subsets of J, 
which intersect A and are compatible with o in appropriate pairs. Then, use the 
fact that F is an intersecting family to conclude that there are at most other 
k — 1 sets in F which are still compatible with o. For ii. assume, without loss 


of generality, that A = {x1,...,x,} ando = (qa,..., Qk, Qk41,.--,dy), with 
{x1,--., Xk} = {a1,..., ax}. Finally, for iii. use double counting. 

Section 2.4 

1. Let S, be the set of sequences a = (aj, ...,G)), with a; € {0, 1} for 1 <i <n, 


and P(n) be the family of parts of [,. For a € S,, let f(a) € P(n) be given by 
f(@ = {1 <i <n; a; = 1}. Now, check that the function f : S, ~ P(n) 
thus defined is a bijection that preserves metrics, in the following sense: if d is 
the Hamming metric in S, and d’ is the metric of symmetric difference in P(n), 
then d(a, b) = d'(f (a), f(b) for alla, be S). 

2. Use the hypotheses to show that T = U,<5 B(x; 3), a disjoint union, and 
|B(x;3)| = (5) + C) + (5) 4: () for every x € S. Then, conclude from 
(b) that |B(x;3) N S| = 1 for each x € 7 and, letting k = mit show that 
3. 2k-? — k(2k? — 3k +4). In order to solve such an equation, start by showing 
that 3 | k; then, write k = 3/ and conclude that / < 4 and, hence, / = 4, k = 12 
and n = 23, 

3. Adapt, to the present case, the discussion of Example 2.29 and the hint given to 
Problem 20, page 32. 

4. Let P(n) be the family of subsets of /,,, furnished with the metric of symmetric 
difference. Show that, for 1 < i < k, the balls B(A;; 1) are pairwise disjoint. 
Then, use (2.20), together with the fact that B(A,;r) U...U B(Ag; r) C P(n). 

5. Firstly, if A and B are the sets of members of two of the 1997 subcommittees, 
note that the set of people who participate of exactly one of these subcommittees 
is AAB, so that |AAB| > 5. Now, apply the result of the previous problem. 

6. Let x1,...,Xn be the sequences in the dictionary and d the Hamming metric 
(with 0 standing for big and 1 standing for small). Show that 


|B(x13 4) U... U Born | = 24. 
Moreover, for 1 <i < j <n,ifx € B(x; 4) B(x;; 4), show that d(x; xj) = 


d(x; x;) = 4 and d(x;; x;) = 8. Use this fact to show that x belongs to at most 
six of the balls B(x;; 4) and, then, conclude that 
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|B(x1; 4) U... UU BQ; 4)| < ((¢) 4 (“) 4 (3) i (7) a 1())n 
0 | 2 3) 6\4 


= 4096n. 


Section 3.1 


1. Termwise differentiate both sides of (3.2) and, then, substitute x = 1. Note that 
Theorem 3.4 justifies such a procedure. 

2. For item (a), argue as was done for the Fibonacci sequence. For items (b) and (c), 
apply the result of (a), together with (3.4). 


Section 3.2 
1. Make induction on k > 2, the initial case k = 2 being given by Proposition 3.3. 
2. Apply Theorem 3.4 k — | times, starting from — = Vino x". Alternatively, 


expand (1 — x)—* with the aid of the binomial series. 

3. For item (c), it follows from (b) and from (3.5) that g’(x) = Yani (—x)! = 
te for |x| < 1. Hence, f’(x) = g’(x) and f(0) = g(0), so that f(x) = g(x) 
for |x| < 1. 


4. For item (b), it follows from the Fundamental Theorem of Calculus that 


yi xt! = F(x) = [ F'(tdt 
0 


n+1 
= pe atdt. 
[ soa [dan t 


n>0 
n>0 


5. Start by showing that, if g > |aj|, ./|a2|, xo, where xp = 5 (a + fur + Alvl), 
then |a,| < q” for every n > 1. Fix such a q and conclude that the 


generating function )>,., axx* converges in the interval (- o 1), thus defining 


a differentiable function f : ( - Zi i) —> Rby f(@) = es axx*. Use the 
recurrence relation satisfied by the sequence (ax)x>+0 to write 
f(x) = Do agx* = ayx + ax? + Dag" 
k>1 k>3 
=ayx+ anx? + Yo (ua + vap_2)x* 
k>3 

=ayx+ anx? + ux pa ae + vx? > ax—ax*? 

k>3 k>3 


=ayx+ anx + ux(f (x) — ax) + ux? f (x). 
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Conclude that 


ax + (a2 — ua,)x2 
(1 —ax)(1 — Bx) 


for |x| < min{ 7, TE a where a and f are the roots of x? — ux — v = 0. If 


a # B, show that there exist unique real constants A and B for which f(x) = 
A + re Then, expand ce and a in geometric series to get 


f@)= 


f(x) = (Ao! + BBE) x*. 


k>1 


Finally, compare this last expression with the fact that f(x) = )°,.1 a,x* to 
obtain a, = Aa‘—! + BB*!, for every k > 1. If w = f, argue similarly. This 
problem is the content of Theorem 11.45 of [8]; further details can be found 
there. 

6. Use the formula for the number of derangements of /,, together with (3.8) and 
the triangle inequality to get 


n! 
d, — —| =n! 
e 


(-1)' 1 
zs kl ls" ge 
k>n k>n 


Then, estimate 


d ml a n! (1+ 1 us 1 ze ) 
O és I nt+2  (n+2)(n+3) 
—(1+ ae ee ) 
— acer 
n+1 n+2  (n+2) 
1 n+2 1 
I . <-. 
n+1n+1 n 


Section 3.3 


1. Letting f(x) = Dips) axx*, we get 


f@) = ax =x Sa =X Yo axsix* 


k>1 k>1 k>0 


=2x+x Ya +(k+1))x* 
k>1 


= 2x +xf (x) +x Yk + 1)x* 
k>1 


= 2x + xf (x) + xg'(x), 
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with e(4) = 345) %" = [oy for |x| < 1. Use such a relation to obtain a closed 
expression for f(x), valid for |x| < 1, expand this expression as a power series 
and obtain the desired formula for a; as a function of k. 

. For item (b), we have 


f(x) =1+ )oanx” =14+ So angix”t! 


n>1 n>0 
=|+ Y > Qan +n)x"t1 
n>0 
=142xf(x) + xc 41 9 ae 
n>0 n>0 


= 1+ 2xf (x) + g’(x) — 2g¢(x), 


with g(x) = = For (c), reduce the sum at the right hand side to a single 
denominator and, then, compare the coefficients of the numerators at both sides 
to obtain a linear system of equations in A, B and C. Finally, for item (d), use 
the result of Problem 2, page 78. 

. We want to compute the coefficient of x” in the product 


fQVS@ 4274.5... @ ba’? +..), 


with k factors x + x? +++. = 74 (for |x| < 1). 
. We want to compute the coefficient of x*° in the product 


(Lt x ta? te. )Q? +x8 $e.) txta7 tet xtx7 4-427). 
Letting f(x), for |x| < 1, denote the above expression, we have 
FQ eae ees PU ee eee’) 
=" 4a ba Gsesck es aaa 
Now, use the result of Problem (2), page 78. 
. For (a) and the first part of (b), apply the multiplicative and additive principles. 


For the coming problem, the reader may find it convenient to read again the 
paragraph that precedes Example 1.15. 


6. For item (a), start by observing that if 


aT e184 2 PI ee 9, 
a ee 


——— 
k summands 7 summands m summands 
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then n = k + 21 + 3m; conversely, any such expression for n gives rise to a 
partition of n in which no summand exceeds 3. Now, note that ifn = k+2/+3m, 


then x” = x*. x?/. x3”, and hence (setting ay = 1) 


eee Oe Ole OL 


n>1 k>0 1>0 mz0 


1 1 1 


if |x| < 1. Item (b) can be solved by multiplying both sides by (1—x)(1 wd 
x>), expanding and comparing coefficients of equal powers of x; the answers are 
a=%b=cH= i andd = ;. Finally, for item (c), insert the values of a, b, 
c and d in the right hand side of the equality of item (b) and use (3.1), together 
with the result of Problem 2, page 78, to get 


7 Sox, i 2 yo, 


k>0 — k>0 


ip E (2) he Deve 


n>0 n>0 


Then, look separately at each of the cases 6 | n, 2 | n but 3{n, 2 { n but 3 | n, 
2{nand3{n. 
6. Letting dy) = d, = 0, item (d) follows from 


! 
Pe 1)! 
(n — 1)dn-1 n—1 (n — 1)dn—2 n—1 
= mi ap) = 
n>2 n>2 
(n — 1)dy-1 yn-2 a) n—2 
ee "Gap —1)! $8 a 


=x +xf'(x) +xf (2). 


Item (e) follows the fact that eae = £ log ( f@)+ ll) Finally, for (f) integrate 
the equality in (e). 

8. For item (a), the number of ways of choosing the 2ao positions of the 0’s, the a1 
positions of the 1’s etc is equal to 


n n — 2ag\ (n — 2a9 — a1 \ (n — 2a9 — ay — a2 _ n! 
2a0 a a2 a ~ (2ag)!aq!a2!a3!" 
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Item (b) amounts to noticing that 


"x i x” 
= 2 Catal Satara 


eo n>1 


4 ae 1 x x 3 
a te a) 
For (c), start by observing that 
¥ e x (—x)* _ x2J 
ee = S(qt kl ) a 
k>0 
so that 
1 x —x x \3 1 4x 2x 
fa) =5[e +e )(e*) =e" +e de 


Finally, expand e** and e?* in power series and compare the result with S 
to finish the problem. 


n>0 a 


Section 4.1 


1. Divide the given square into four small squares of side | each, and let them 
play the role of pigeonholes. Then, notice that the length of the diagonals of 
each small square is precisely V2. 

2. Divide the cube into 9? = 729 small cubes, each of which having edge length 
equal to ». Such small cubes will be the pigeonholes, and have diagonals of 


length - with ie <2. 


3. Let the pigeonholes be the intervals [ i Et 


n 


) , with 0 < / <n, and the pigeons 


be the feeuoneyy parts! of the numbers kx, with 1 < k < n — 1. Note that if 


{kx} € [0, =) U [> , 1), for some integer 1 < k < n — 1, then there is nothing 
left to do. Else, A € Ler [, Ht) for 1 < k < n —1, and it suffices to 


apply the pigeonhole principle. 


‘Recall that the fractionary part of x € R is the real number {x}, given by {x} = x — |x]. In 
particular, note that {x} € [0, 1), with {x} = 0 if and only if x € Z. 


22 
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. By contraposition, if aj <m,a2 <m,..., a, <m, then 


ai +d2+-::-+ay, 2) 


I 
= 


n n 


The other case is entirely analogous. 


. Let us argue as in the previous problem: since each number is a summand of 


five sums of five consecutive numbers (around the circle), the total sum of all 
possible sums of five consecutive numbers equals 


1+ 15)-15 
ae 5=40-15. 
2 
Since there are 15 sums of five consecutive numbers, the average value of each 
one of the is ue = 40. Therefore, by the result of the previous problem, at 


least one of these 15 sums is greater than or equal to 40. 


. Apply the result of Problem 4, counting the total number of color matchings of 


corresponding sectors in disks A and B, in each of the 200 possible matching 
positions. 


. Consider the 51 sets {0}, {1,99}, {2,98}, ..., {49,51}, {50}. Since there 


are exactly 100 possible remainders upon division of an integer by 100 (the 
remainder varies from 0 to 99), the pigeonhole principle guarantees that at least 
two of the 52 given numbers have remainders belonging to a single one of these 
51 sets. Therefore, either their sum or their difference leaves remainder 0 upon 
division by 100. 


. Letting n be a given natural number, take as pigeons the n + 1 numbers 1, 11, 


111,..., 11... 1 (this last one with exactly n + 1 digits 1). Let the pigeonholes 
be the remainders upon division by n, i.e., 0, 1,..., — 1. 


. Given a natural number n, adapt the hint given to the previous problem to show, 


with the aid of the pigeonhole principle, that n has multiples of the forms a = 
33...300...0andb = 77...700...0. Then, show that it is possible to choose 
m € N such that either a + 10D or 10” a + b has odd sum of digits. 
Condition gcd(a, n) = | assures that none of the given powers of a is divisible 
by n. Thus, we can distribute them in boxes labelled from | to n — 1, according 
to their remainders upon division by n. If box 1 does contain at least one of the 
given powers, there is nothing left to do; otherwise, the n — 1 given powers of 
a shall be allocated among n — 2 boxes with labels varying from 2 ton — 1. By 
pigeonhole’s principle, there exists 2 < k < n — 1 such that the box labelled k 
contains at least two distinct powers of a, say a anda/, with 1 <i < j<n-t. 
Then, we have a! =ns +k anda/ =nt +k, for some s,t € Z, and it follows 
that 


ai(ai— -l= ai-adi= n(t —S). 


Therefore, n | a'(aJ—! — 1) and, since gcd(n, a') = 1, we conclude that n | 
(a/—! — 1). 
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12. 


13. 


14. 


15. 


16. 


17. 


22 Hints and Solutions 


If one of the n sums a}, aj + a2, qj + a2 + 43,...,a, +d. +--+-+ ay leaves 
remainder 0 when divided by 1, there is nothing else to do. Otherwise, distribute 
them into boxes numbered from | to n — 1, according to their remainders upon 
division by n. Since we have n sums and n — 1 boxes, pigeonhole’s principle 
guarantees that at least two sums will be in a single box. Letting aj + ---+ ag 
and a; +---+ag+---+ a) be these two sums, conclude that ag +ag41 +--+ -+a) 
is a multiple of n. 

Notice that each element of Jo, is of the form aka, with k > Oandgq e€ 
{1,3,5,...,2n — 1}. Then, apply pigeonhole’s principle, with the pigeonholes 
being the n odd numbers 1, 3, 5, ..., 2n — 1 and the pigeons being the n + 1 
selected elements of Joy. 

Each element u ¢€ A is of the form u = 2%!3%25°37%411%, Take, as 
pigeonholes, all possible sequences (G1, @2, @3, @4, @5), with aj = O or | for 
1 <i <5. Then, place u € A into the pigeonhole (a1, @2, @3, @4, a5) if and 
only if a; and a; have equal parities, for 1 <i <5. 

By contraposition, assume that none of the 15 chosen elements is a prime. Then, 
Corollary 6.36 guarantees that each one of these 15 numbers has a prime divisor 
which is less than or equal to 1998 ~ 44,69. Now, notice that there are 
exactly 14 primes less than or equal to 44, namely: 2, 3, 5, 7, 11, 13, 17, 19, 
23, 29, 31, 37, 41 and 43. Since we have chosen 15 numbers, the pigeonhole 
principle assures that at least two of these will be divisible by a single one of 
these 14 primes and, this way, will not be relatively prime. 

On the one hand, A has 2!°—1 = 1023 nonempty subsets. On the other, the sum 
of the elements of any such subset of A is at most 90+91+92+---+99 = 945, 
Hence, by the pigeonhole principle, at least two nonempty subsets of A, say B 
and C, have equal sums of elements. Thus, B \ (BC) and C \ (BMC) are 
the subsets of A we are looking for. 

If A c {1, 2,3,...,2n+ 1} is free of sums, we claim that |A| < n + 1. Indeed, 
letting A = {1,3,5,...,2n+1}, we have |A| = +1 and A is free of sums, for 
the sum of any two (not necessarily distinct) elements of A is an even number, 
while all elements of A are odd. Now, let B C A be such that |B] =n + 2; we 
will show that B is not free of sums. If B = {x1 < x2 <--+ < Xn+2}, then 


1 < Xn42 — Xn41 < +++ < Xn42 —X1 XS 2n. 


If none of these n + 1 differences x,42 — x; belongs to B, then all of it shall 
belong to {1,2,3,...,2” + 1} \ B. However, since this last set has only n 
elements, pigeonhole’s principle assures that at least two of the numbers x42 — 
x; are equal. This is a contradiction 

We first claim that, among the n + | chosen numbers, there are necessarily two, 
say x and y, such that |x — y| < 2. In order to verify such a claim, it suffices to 
apply the pigeonhole principle, distributing the n + 1 chosen elements among 
the n sets {3k—2, 3k—1, 3k}, with 1 < k < n. Now, letting x and y be as above, 
assume, without loss of generality, that y > x. If y—x = 1, then4xy+1 = 
4x(x+1)+1 = (Qx+1)*. If y—x = 2, thenxyt+1 = x(v+2)41 = (x41). 
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18. We shall show that f (7) = 2n. To this end, let 
{1,2,...,2n}=CyUCoU---UC, 


be an arbitrary of If(n) into n sets. Since from n to 2n there are n + 1 integers, 
the pigeonhole principle guarantees the existence of n < u < v < 2n and 
1 <i <n such that u, v € C;. Taking a = 2u—v > Oandx = y=v—-u>1 
we have 


fatx,aty,a+x+y}= {u,v} CG; 


therefore, f(n) < 2n. Now, consider the following partition of y,—; into n 
sets: 


n—1 
{1,2,...,2n— l= (tk, k +n} U {n}; 
k=1 
we shall show that the stated conditions are not fulfilled. Indeed, if 


fa+x,aty,atx+ y}= {u,v} C Cy = tk, k +n}, 


thna+x=a+y=kanda+x+y=k-+n,s0 thatx =n anda = 


k —x = k-—n < 0, which is an absurd. Finally, it is enough to see that 
{a+x,a+y,a+x-+y} has at least two elements and, hence, cannot be equal 
to {n}. 


19. For item (a), assume that the 21 squares of the chessboard have been painted. 
Call a column black if it has at least two black squares, and white if it has at 
least two white squares. Since we have seven columns, the pigeonhole principle 
assures that at least four of them are of the same color, say (without loss of 
generality) black. We can also assume that these are the first four columns 
of the chessboard, from left to right. If any of these four columns has three 
black squares, there is nothing left to do: it suffices to take two black squares of 
another columns to get a rectangle with four black squares in the corners (see 
Fig. 22.1). 

Otherwise, in each of these four black columns there are exactly 2 black 
squares. Since in a single column there are exactly three ways of choosing these 
two black squares, the pigeonhole principle assures that at least two of the four 


Fig. 22.1 The first case of 
item (a) 
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Fig. 22.2 The second case of 
item (a) 


Fig. 22.3. A painting for item 
(b) 


columns will have two black squares in the same positions. These two pairs 
of black squares in the two columns are the corner squares of a rectangle with 
sides parallel to those of the chessboard (see Fig. 22.2). 

For item (b), the total number of ways of choosing 2 out of 4 squares is C) = 
6. Thus, we have to paint black exactly two squares in each of the six columns, 
choosing the two black squares at the columns in all six possible distinct ways 
(for instance, squares | and 2 in the first column, | and 3 in the second, 1 and 
4 in the third, 2 and 3 in the fourth, 2 and 4 in the fifth and 3 and 4 in the 
sixth—see Fig. 22.3). 

20. Choose any person, say A; by the pigeonhole principle, A exchange letters on a 
single subject with at least |) + 1 = 6 other people. Now, looking at these 
six people, there are two cases to consider: either there are two of them which 
correspond on the same subject as that of the letters they received/wrote to A— 
and in this case there is nothing left to do—or any two of them correspond on 
another one of the two left subjects. In this last case, repeat the above reasoning, 
applying the pigeonhole principle to these six people, with any two of them 
exchanging letters on one of two possible subjects. 


Section 4.2 


1. Start by observing that if a square can be partitioned into k other squares, then 
it can also be partitioned into k + 3 squares. Then, show how to partitionate a 
square into 6, 7 or 8 squares. 

2. Label the rows and columns of the chessboard from top to bottom and from left 
to right, from | to n, and let a;; denote the number written in the (7, j)-cell (i.e., 
that situated in row i and column /). If S; stands for the sum of the numbers 
written in the k? cells of ak x k chessboard, show that 
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Sn = Sn—2 + Gnn — An—1,n—-1- 


Then, apply an inductive argument. 


. Give a convex polygon A, A2...A;Ax-+1, its interior is equal to the union of 


the interiors of the convex polygons A, Az... Aj; ...AxAx+1, where the” over 
Aj means that such a vertex is excluded. Hence, O belongs to one of these 
convex k-gons, say A; A2... Ax. Now, insert an induction hypothesis and show 
directly that, for some 1 < i < k, at least one of the angles A; O Ag+ is not 
acute. 


. Make complete induction on the number k of sides of the polygon. For the 


induction step, consider a polygon P of n vertices, and draw some of its 
diagonals in such a way that no two of them intersect in the interior of P. 
Assume, without loss of generality, that these diagonals partitionate P into 
triangles. Fix one of the traced diagonals, say AB, and let P; and P2 be the 
two polygons into which P gets divided by AB. Use the induction hypothesis 
to show that P; and P> possess a vertex different from A and B, which is not 
incident to none of the drawn diagonals. 


. By induction hypothesis, suppose we can paint k towers as described, for a 


certain k < n”. Place k+1 towers in the chessboard, discard the left and upmost 
tower and apply the induction hypothesis to the k remaining towers. Then, paint 
the discarded tower with a color (out of the three given ones) distinct from the 
color(s) used to paint the towers (if any) which are closest to and belong to the 
same line or column of the discarded one. 


. Suppose that the given property is true for k lines, and draw the (k + 1)-th, say 


r. Choose one of the semiplanes into which r divides the plane and maintain 
the colors of all regions situated in this semiplane (even those resulting from a 
division of a former region in two by r); then, change the colors of all regions 
contained in the opposite semiplane (again, even those coming from a division 
of a former region in two by r). 


. Make induction on the number of columns of the board (independently of the 


total number of lines). To this end, say that the chessboard is é blue (resp. red) 
if the queen can visit all of its blue (resp. red) squares as prescribed in the 
statement of the problem. For the induction step, apply the induction hypothesis 
to the m x n board obtained from an m x (n + 1) one by the exclusion of 
its rightmost column. Upon doing this, consider three separate cases: (i) the 
smaller board is blue but nor red; (ii) the smaller board is red but not blue; (111) 
the smaller board is at the same time blue and red. In case (i), if the erased 
column has any blue square, then at least one of the remaining squares of the 
line of this blue square is also blue; show that, this being the case, the bigger 
chessboard is also blue. In case (ii), argue in an analogous way. Finally, in case 
(iii), start by observing that we can assume that if any square of the erased 
column is blue (resp. red), then the remaining squares of the same line of 
this blue (resp. red) square are all red (resp. blue). Let the queen walk along 
a diagonal of the board to show that it still is blue and red. 
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For item (a), let r be a line passing through two of the n given points, say A 
and B, and which is at a minimum positive distance from one of the remaining 
points, say C. Argue by contradiction to show that, from the n given points, A 
and B are the only ones situated on r. For item (b), use induction. The initial 
case is n = 3, while the induction step is taken care of by the result of (a). 
Make complete induction on n. For the induction step, assume the desired 
property to be true for 1 < n < k, with k e€ N, and consider 2k + 2 
points satisfying the given hypotheses. Let P be the smallest convex polygon 
containing such points (so that the vertices of P are some of these points). 
Consider two cases separately: 


(i) There are two consecutive vertices A and B of P such that A is blue and B 
is red: draw AB and apply the induction hypothesis. 

(ii) All vertices of P are monochromatic, say red: take an edge of P and call it 
B, Bz. Number the blue vertices Aj, A2,..., Ag, Ag+1, Such that 


BoB A, < BoB, A> << BB Agy1. 
Define a function f : {1,2,...,4 +1} — N by letting 
f@ = #(pontos vermelhos no interior de Bz By, A;) + 2, 
so that 
2270S I QVEVMSfOs [et V Sk 


(note that the last inequality comes from the fact that P has at least three 
vertices, at least one of which is not interior to 2 Bz B,Ax+1). Conclude 
that there exists 2 < i < k such that f(i) = i, trace B, A; and apply the 
induction hypothesis. 


Let us make induction on n. For n = | we can take kj = 3. Suppose the 
conclusion to be true for n = 1, and consider a set of m points in the space, 
no four of which being coplanar and each line segment joining two of them 
having one of the numbers 1, 2, ...,/,/ + 1 associated to it. Fix one of these m 
points, say A, and look at the m — | line segments joining A to the remaining 
m — 1 points. Since each one of these m — | segments has associated to it one 
of / + 1 given numbers, the pigeonhole principle assures that, out of these / + 1 
numbers, a certain number x is associated to at least 


m= |S |-1 
eT 


distinct segments. If M > kj, we can choose points A}, Az, ..., Ax, such that 
the line segments AA; (1 <i < k;) have x associated to themselves. We now 
look at the line segments joining two of the points Aj, Az, ..., Ax,. If one 
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of them, say A;A;, also has x associated to itself, there is nothing left to do: 
AA;Aj; will be a triangle with x associated to all of its sides. Otherwise, points 
Aj, A2,..., Ax will be a set of k; points in space, no four of which being 
coplanar and with one of the / numbers 1, ...,x —1,x+1,...,/+ 1 associated 
to each of the line segments joining two of them. By induction hypothesis, we 
get the existence of a triangle having the desired properties. In order to finish, 
it suffices to note that M > kj ifm > (1+ 1)(k) — 1) +2. 


Section 4.3 


1. In each item from (a) to (d) one has to check that the corresponding relation is 
reflexive, antisymmetric and transitive. For item (a), we already know that a = a; 
a < bandb < aimply a = b;a < bandb < cimplya < c. The verification 
for (b) is quite similar, and we just point out that if Y and Z are subsets of X 
satisfying Y C Zand Z C Y, then Y = Z. We leave item (c) as an equally 
easy exercise for the reader, referring to items (a) and (b) of Proposition 6.5 
(if he/she finds it necessary) for complete arguments. Finally, for item (d), the 
desired properties are inherited from those of < in A. 

2. Apply Mirsky’s theorem to A, partially ordered by the divisibility relation. 

3. If (A, ) is a partially ordered set such that |A| = ab +1, witha,b € N, 
and c = min{a, b}, then A contains a chain of c + 1 elements. The proof is an 
easy adaptation of the proof of Theorem 4.25. It starts by letting n = ab + 1, 
A = {x1,...,X,} and A= {(i, x3); 1 < i <n}, ordered by <,, where 


@, xi) $1 Gs xj) & @ = J) or @ < j and x; X xj). 


The rest of the proof goes on almost unchanged. 

4. Make induction on m, first noticing that the case m = | corresponds to the case 
a = b in the Erdés-Szekeres theorem. For the induction step, write n" +1as 
(n2)2"" + 1 and apply the Erdés-Szekeres theorem again. 

5. Let x € AN B and assume, by the sake of contradiction, that A and B have 
distinct lengths, with B being the longest. Let Aj = {y € A; y< x}, A2={y € 
A; x x y}, By = {z € Bs zx x}, Bo = {z € B; x X z}. Show that A, U Bo and 
Az U B, are chains, and at least one of then is longer than A. 

6. For item (a), assume that A; N Bp = @ for some index 1 < i < k, apply 
the pigeonhole principle and arrive at a contradiction to the fact that Bo is 
an antichain. For item (b), start by letting B; be an antichain of k elements 
containing x;. If j ¢ i, show that Aj; 1 B; A @, take y € A; M B; and 
successively conclude that y x x; and x; 4 x;. Now, reverse the roles of i 
and j to show that x; 4 x;. For item (c), i., start by observing that B; \ {x;} is 
an antichain of k — 1 elements in A \ C. On the other hand, if A \ C contained 
an antichain B of k elements, then B would intersect {z € A;; x; ~ z}, which 
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contradicts the definition of x;. For (c), ii., the induction hypothesis guarantees 
that A\C = Cy U...UCg_-1, with Ci, ..., Cg_1 being pairwise disjoint chains; 
since C is itself a chain, we thus obtain A = C; U...U Cy_1; UC, a union of 
k pairwise disjoint chains. Finally, for (d), i. follows from the hypothesis of (d), 
together with the fact that a is a maximal element. Subitem ii. is straightforward: 
A= {a}UA,U...UAg. 


Section 4.4 


1. No. On the one hand, one can easily prove that, after any number of operations, 
each bottle will contain an amount of water expressed in liters by a fraction of 
the form &, with a, k € N. On the other hand, if one could reach the situation 
in which every bottle has the same amount of water, then each one of them 
would contain 4 1 of water. 

2. Show that the trinomial’s discriminant is an invariant for the allowed operations. 

3. No! To see why, first look at the chessboard with its usual alternate black and 
white painting. Since each 2 x 1 rectangle covers exactly one black and one 
white square, after we have distributed any number of rectangular pieces, these 
will have covered equal numbers of black and white squares (this is the invariant 
that will solve the problem). Nevertheless, when we cut out two opposite 
squares, the remaining 62 squares of the chessboard comprise 32 squares of 
one color and 30 of the other. Hence, the cut out chessboard cannot be covered 
as asked. 

4. Ifat some moment we replace a and b by |a—b|, then the difference between the 
sums of the numbers written on the blackboard before and after the operation is 
(a + b) — |a — b|, which is always even. Therefore, the parity of the sum of the 
numbers written on the blackboard at any moment is an invariant with respect 
to the allowed operations. Now, observe that 1 + 2+ ---+ 1997 = 999. 1999, 
which is odd. 

5. If S and S’ denote the sum of the numbers of two consecutive lists of six 
numbers, check that S’ = S (mod7). Then, note that 1+2+3+44+5+6= 21 
and3+7+2+4 1548-48 = 43, but 21 4 43 (mod7). 

6. Label the coins consecutively from 1 to 10. Then, associate to each coin a 
weight, which is 0 if the coin is showing head, or is equal to the label of the 
coin, if it shows tail. Then, verify that each move does not change the parity of 
the sum of the weights associated to the coins. 

7. Firstly, distribute the members in the two houses at random, and let S denote 
the sum of the numbers of enemies that each member has in the house he/she is 
in. Now, choose a member and look at his/her number of enemies within his/her 
present house; if this is 2, then move him/her to the other house and note that 
this operation diminishes the value of S. Now, repeat this as long as possible. 

8. Note thata + b+ab+1= (a+ 1)(b+ 1). Therefore, if at a certain moment 
the numbers written on the blackboard are x), x2, ..., xz, then the value of the 
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expression (x; + 1)(x2+1)...(x% +1) does not change. In the beginning, such 
a value was (1+ 1)(24+ 1)...@+1) = (1+ I)!. Hence, letting x denote 
the number written on the blackboard after n — | operations, we have x + | = 
(n+ 1)!, so that x = (n+ 1)! — 1. 


. No! In order to prove this, let a, b, c, d and a’, b’, c’, d’, respectively, denote 


the quantities of cards at the vertices of the square (in the clockwise sense), 
immediately before and after some step. If f(w, x, y,z) =w—x+y—z, itis 
immediate to check that f(a’, b’, c’, d’) — f(a, b,c, d) = 3k, where k is the 
number of cards removed from one of the vertices at that step. In particular, 


f(a',b’, c,d’) = f(a, b,c, d) (mod 3). 


Thus, the remainder of f(w, x, y, z) upon division by 3 is an invariant asso- 
ciated to the allowed operations. Now, if there were a sequence of operations 
leading from the first to the second configuration, we should have 


fd, 9, 8,9) = fd, 0,0,0) or fd, 9, 8,9) = FO, 1, 0, 0) (mod 3), 


which is an absurd. 

Start by drawing n line segments satisfying condition (a) and with distinct 
endpoints. Then, let Aj and A> be blue points and V; and V2 be red ones, such 
that A; Vj and A2V> are two of the drawn line segments. If A; Vj N A2V2 4 G, 
erase them and draw Aj V2 and A2 Vj instead. Now, use the (geometric) triangle 
inequality to show that the sum of the lengths of the n drawn line segments 
decreased. Finally, use the finiteness of the set of points, together with the semi- 
invariant consisting of the sum of the lengths of the n line segments, to show 
that, at some moment, condition (b) will be satisfied too. 

A will do. To see why, let us paint the unit cubes black and white, so that 
adjacent cubes have different colors and each player moves from a unit cube of 
one color to another one of the other color. We consider two cases: e n is even: 
group the unit cubes into n>/2 blocks 1 x 1 x 2 and make A start in a white 
cube, always moving to the black cube of the same | x | x 2 block. en is odd: 
assume, without loss of generality, that the unit cubes placed at the corners of 
the bigger one are black. Ignore one of these black unit cubes and group the 
remaining n> — 1 ones into (n* — 1)/2 blocks 1 x 1 x 2; again, A should start 
in the white cube of a block and follow the strategy of the first case. 

For item (a), number the unit squares of the chessboard, modulo 4, as shown 
in the table below, noticing that there are 16 squares with each integer from 0 
to 3. Any piece of one of the displayed shapes (and this is the invariant!) will 
occupy four unit squares such that the sum of the numbers written on them is a 
multiple of 4. 
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10321032 
23012301 
32103210 
01230123 
10321032 
23012301 
32103210 
01230123 


For item (b), take five unit squares of the chessboard, placed as shown in the 
figure below: 


If it was possible to number the squares as desired, we would have, modulo 4, 
a+b+ct+d=at+b+d+ezat+b+cte=dt+ct+d-e. 


Hence, it should be the case that a = c = d = e (mod 4). Check that this would 
give 30 squares of the chessboard with numbers all congruent modulo 4, which 
is impossible. 

Whatever the way of placing the cubes such that the faces touching the table 
form the board of a 6 x 6 square, each cube will have 3 visible and 3 invisible 
faces. For any such placement of the cubes, let vg and vw respectively denote 
the numbers of black and white visible faces. Since 


lug — vw| = vg — vw = vg + vw = 60 = 0 (mod 2), 


we conclude that |vg — vw| is always even (here we have the invariant!). 
Assume, without loss of generality, that vw > vg, and hence that vw > 31 
and vg < 29. Show that there is at least one cube with more invisible than 
visible black faces, and then that it is possible to change the position of this 
cube in such a way that |ug — vw| decreases two units. 

Note that, after any number of operations, the quantity, say n, of written 
numbers does not change. On the other hand, thanks to the property we alluded 
to before the statement of the problem, the product P of the written numbers 
also does not change. Since there is just a finite number of ways of representing 
P as a product of n positive integers, there is a largest positive integer m that 
appears in the blackboard at some moment. We claim that such a number 
is a multiple of all the other ones written at this moment. For the sake of 
contradiction, assume that, at this moment, some number k written on the 
blackboard does not divide m. Then Icm (m,k) > m, so that changing m and 
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k by gcd(m, k) and lcm (m, k), we conclude that m is not the largest possible 
number to appear on the blackboard. This is an absurd! Thus, by this claim, 
when m appears on the blackboard, the written numbers are m1, m2, ..., ™n—1 
and my, = m, with m1, m2,..., Mp,—, dividing m. Now, show that m will always 
be on the blackboard, so that, for all purposes, the subsequent operations go on 
as if we were operating only with m, ..., my,—1. Then, we can use the same 
argument as above with these n — | numbers, thus concluding that, after a finite 
number of operations, we shall reach a situation in which the written numbers 
X1,.X2,..., Xn are such that x1 | x2 | --+Xn, | Xn. From this time on, nothing 
changes on the blackboard. 


Section 5.1 


1. Consider a graph whose vertices are the 100 guests and in which two vertices 
are adjacent if and only if the corresponding guests know each other. Then, 
apply the result of Corollary 5.6. 

2. Look at the possible degrees of a vertex of a graph with n vertices and, then, 
apply the pigeonhole principle. You may find it useful to review the discussion 
of Example 4.2. 

3. Assume, without loss of generality, that V; = V2 = I,, for some n € N. If 
f : I, — I; is an isomorphism between G, and G2, then f is a permutation 
of J,; show that, applying f to the rows of Adj(G,), we get Adj(G2). Argue 
similarly for the converse. 

4. Let Gy = (Vi; £1), Go = (V2; Er) and G3 = (V3; E3) be given graphs. If 
ff: Vi — Vo (resp. g : V2 — V3) is a graph isomorphism between G, and G2 
(resp. between G2 and G3), show that | a : Vo —> V, (resp. go ff: Vi > V3) 
is a graph isomorphism between G2 and G, (resp. between G ; and G3). For 
reflexivity, use the identity, Id: V; > Vj. 

5. If V = {uy,...,un} and G = (V; E) is a labelled graph, then E C P2(V); 
conversely, given E C P2(V), we obtain a labelled graph G = (V; E). Hence, 
it suffices to count the number of distinct subsets E of P2(V), which is 2G), 

6. Assume that such a graph, say G, does exist, and let a and b be its vertices of 
degree | and c its vertex of degree 6. Since the graph has seven vertices, c must 
be adjacent to both a and b. Therefore, if H is the graph obtained from G by 
the excision of a and b, then H has five vertices, of degrees 2, 3, 4, 5 and 4 
(recall that c has lost two of its edges). But this is an absurd, for in H we have a 
vertex of degree 5 which can have at most four neighbors. Alternatively, apply 
Havel-Hakimi’s algorithm. 

7. Try to build the graph by imposing that the vertex of degree 3 is not adjacent 
to that of degree 6. Then, observe that the vertex of degree 6 must be adjacent 
to those of degrees 1, 1, 2, 4, 4 and 5. Alternatively, apply Havel-Hakimi’s 
algorithm. 


562 


8. 


10. 


11. 


12. 


13. 


22 Hints and Solutions 


Argue by induction on n. For the induction step, if G is a graph of vertices 
Uj, VI, ..., Uk], Ue_1, With dg(u;) = dg(v;) = i for 1 <i < k—1,adda 
vertex u to G and make it adjacent to uj, ..., ug—1; then, add a vertex v and 
make it adjacent to uw. 


. Item (a) follows from the fact that, for given u,v € V, we have uv € E or 


uv € E°; hence, Ng(u) 1 Ng(u) = B and Ng(u) U Neu) = V \ {u}, so that 
dgtu) + dg(u) = #(NG(u) U NGtu)) = #V \ {u}) =n —- 1. 


For item (b), it suffices to observe that EM ES = @ and E U E* = P2(V), so 
that 


n 
|E| + |E°| = |EU E‘| = |P2(V)| = (3). 


Letting G = (V; E) be a self-complementary graph, we have |E| = |E‘|. 
Hence, item (b) of the previous problem gives 


sini) 22 


so that 4 | n(n — 1). The rest is immediate. 

G—B=(V\B; E’), where E’ is the set of edges of G which are not incident 
to any vertex in B.G—A=(V; E\ A). 

For item (a), note firstly that both G — u and Gjy(G)\ {uj} have V(G) \ {u} as set 
of vertices. Now, if € is an edge of G, then € is an edge of G — u if and only if € 
is not incident to u, which is the same as asking that € be an edge of Giy(G)\ {u}. 
Finally, the proof of (b) is analogous to that of (a), once w apply the discussion 
of the previous problem. 

The only vertices of G which change degree in H are u and v, and they do so 
in such a way that dy (u) = dg(u) + 1 and dy(v) = dg(v) + 1. Now, there are 
three essentially distinct possibilities: (i) u and v have odd degrees in G: then, 
u and v have even degrees in H, so that H has two odd degree vertices less than 
G; (ii) u and v have even degrees in G: then, u and v have odd degrees in H, 
so that H has two vertices of odd degree more than G; (iii) u has even degree 
and v has odd degree in G: then, u has odd degree and v has even degree in H, 
so that H has exactly the same number of vertices of odd degree as G. For the 
second part, let us start with a graph G = (V; E), with E = {€1,..., €m}. For 
O0<k<~m, let Gg = (V; Ex), with Ex = {€441,..., €m} forO < k < m, and 
Em = @. If vg denotes the number of vertices of odd degree in Gz, then v,, = 0 
and, hence, 


m-1 m—1 


vo = IC: = Ue41) + Um = So (ve — Uk+1). 


k=0 k=0 
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However, by the first part of the problem, we have vg — vg41 = —2, 0 or 2, so 
that vy — vz¢41 is always even. Therefore, vp, as a sum of even summands, is 
also even. 

Item (a) will be left to the reader. For item (b), if m = 2n and 1 <i, j < 2n, 
then 


{i, j}<€ ES j =i +n(mod2n). 


From this, conclude that vertex i + (modulo 27) is the only neighbor of vertex 
i; also, the edges of G(2n,n) are those joining vertices i andi +n, for 1 < 
i <n, so that there are exactly n edges. For item (c), given 1 <i < m, 
show that vertex i is adjacent to vertices i + n and i — n (modulo m) and that 
i+n 4¢i-—n (modulo m), thus giving d(i) = 2; for what is left to do, use 
Euler’s theorem to conclude that |E| = m. Finally, in what concerns item (d), 
show that Id : I, —> Im is an isomorphism between G(m,n) and Gm, m—n). 
Verify that the desired graph is the one depicted in Fig. 5.4. 

Firstly, note that the very definition of bipartite graph assures that, for u, v € V, 
we have uv € E > u € Vj and v € Vo, or vice-versa. Hence, we can identify 
E with a subset of V; x V2 through the injection uv b (u,v), ifu € Vj and 
v € V). Therefore, |E| < |Vi x V2] = |Vi| - | V2], with equality if and only if 
E=YV, x Vo. 

If |Vi| = |Wi| and |V2| = |W], take bijections f; : Vj > W, and f2 : V2 > 
W2; then, let f : Vi U V2 > Wy, U Wy) be given by setting fiy, = fi and 
fiv. = f2, and show that f is an incidence-preserving bijection. Conversely, 
let f : Vij U V2 — W, U Wy) be an incidence-preserving bijection. Since Vj is 
an independent set, the same happens to f (V1). Hence, f (Vj) cannot intersect 
both W, and W2, so that f(V,) C W, or f(V1) C Wo. Assume that f(V1) C 
W, (the other case is entirely analogous). Since every vertex in V2 is adjacent to 
every vertex in Vj, we conclude that every vertex in f (V2) is adjacent to every 
vertex in f (V1) C Wj; then, it follows from the fact that W; is an independent 
set that f(V2) does not intersect W), i.e., f(V2) C W2. Now, 


IVi U Val =|f(1 U V2) = | FV + AS (2)I 
< |Wi| + |W] = |W1 U Wo] = [V2 U Val, 


with the last equality coming from the fact that f is an isomorphism. Therefore, 
|f(Vi)| = [Wil and | f(V2)| = |W, so that f(Vi) = Wi and f(V2) = Wo. 
Finally, this gives |Vj| = |W1| and |V2| = |W2|. 

Assume that G has / vertices of degree 5 in V; and k vertices of degree 8, so that 
|Vi| = k+1 and |V2| = 16 —/. Since the graph is bipartite, the edges departing 
from V, are incident to vertices in V2, and vice-versa. Hence, 8k+5/ = 5(16—/) 
or, which is the same, 4k + 5/ = 40. It is now immediate to verify that the 
nonnegative integer solutions of such an equation, with k > 0, are (k,/) = 
(5, 4) or (10, 0). In each one of these cases, exhibit the corresponding bipartite 
graph. 
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For item (a), apply Euler’s theorem. For (b), start by letting the n vertices be 
those of a regular n-gon, and consider the two following cases: (i) ifr = 2m < 
n, join each vertex to the 2m closest ones; (11) if r = 2m + 1 <n (so that n is 
even), join each vertex to the 2m closest ones, as well as to the vertex symmetric 
with respect to the center of the polygon. 

Let P}, Po,..., Pj; be the patrols and h1, hz, ..., hy be the volunteers, and join 
h; to P; if h; is a member of P;. Also, let k be the number of members of each 
patrol and A; be the set of men in patrol P;. Since we have n men altogether, it 
follows that 


n= |A,UA2U---UA}]|. 
Therefore, by inclusion-exclusion, 


n = |A,;UA2U---UAq]| 


=A - ain Ayl+ YO aN Ay Ag — 
i 


i<j i<j<k 


However, since no man belongs to three or more patrols, the only nonzero 
summands in the last expression above are the two first ones, so that 


ll 
n= Dail — D1ainajl= uk ({)) = ks. 
i 


i<j 


Now, let us count the number of edges of our (bipartite) graph in two distinct 
ways: on the one hand, since each man has degree 2, we have a total of 2n 
edges; on the other hand, since each patrol has degree k, the number of edges 
is equal to 11k. Thus, 2n = 11k. Finally, solving the linear system of equations 


n= 11k—55 
2n = IIk. , 


we getn = 55 andk = 10. 

For item (a), adapt the reasoning used to solve the previous problem to get 
g =m—1andn = ('). For (b), set Vi = {a1, a2,..., am}, V2 = {bj.e3 1 < 
j <k < m} and let a; to be adjacent to b;,, if and only ifi € {j, k}. 

Look at the posed situation as a bipartite graph G, with the elements of the 
independent sets of vertices being the students and the problems, and such that a 
student and a problem are adjacent if and only if the student solved the problem. 
If each problem was solved by k students, apply double counting to conclude 
that k = x Now, let E be the set of students, e € E and P be the set of the y 
problems solved by e. If H is the subgraph of G induced by (E \ {e}) U P (cf. 
Problem 12), then H is also bipartite; apply double counting to H to conclude 
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that 3(x — 1) = y (5 _ 1). Then, deduce that (x, y) = (4, 9) or (8, 7). Finally, 
bipartite graphs corresponding to those possibilities can be easily constructed 
from the respective adjacency matrices A, which we exhibit below: 


(i) (x, y) = (4,9): 


111111111000000000 
OOD0000I1TILI1111000 
OOOLLIOOOLLIOOOIII 
111000000000111111 


(i) (, y) = (8, 7): 


11111110000000 
OOD0O01LIL11I110001 
OOLLIOOLOOIIOL 
11001000111100 
10100100101011 
10010101010110 
O01100011011010 
OLO1LOOL1O1OOI1I1 


By the sake of contradiction, assume that there is a way to place the given 
numbers as stated, and consider the graph G = (V; EF) such that V = 
{1,2,..., 13} and {x, y} € EF if and only if x and y are neighbors in the circle. 
Then, G has 13 vertices and 13 edges, and all vertices have degree 2. Now, look 
at the sets of vertices A = {1, 2,3, 4, 10, 11, 12, 13} and B = {5, 6, 7, 8, 9}; we 
can have at most two edges joining pairs of elements of A: one joining 1 and 4 
and another joining 10 and 13. Since we have 13 edges altogether, two of them 
departing from each vertex, we conclude that at least 4.2+4-1 = 12 edges 
join elements of A to elements of B. Since B has five elements, the pigeonhole 
principle guarantees that at least one of them must receive at least three edges, 
which is an absurd. 

Item (a) is immediate and (b) follows from the definitions of chain, antichain, 
vertex cover and matching. For item (c), note that (from i.) a minimal vertex 
cover corresponds to a maximal antichain, whereas (from ii.) a maximal match- 
ing corresponds to a maximum collection of disjoint chains (thus containing all 
vertices of degree 0) whose union is Vj U V2. 

For item (a), show that }°,, evd + (uw) counts each subset of two vertices exactly 
once. Item (b) follows directly from (a). For item (c), show that if (u,v) and 
(u, w) are edges of G, then there is exactly one transitive K3 in G involving u, 
v and w; then, conclude that the number of distinct transitive K3’s in which u 
enters with out degree 2 is equal to (7). For item (d), note that the number 
of K3’s in G is exactly (Cys then, use the results of items (c) and (a). For item 


566 


2. 


22 Hints and Solutions 


(e), substitute d+(u) = (d+(w) —d*) +d” in the formula of (d) and expand, 
using, whenever convenient, the result of (b). 

Looking at the described situation in the language of the previous problem, 
conclude that the number of sets one asks to maximize is at most 112. Then, 
write down an explicit table of results for the C) = 91 distinct matches, 
showing that it is indeed possible that one has 112 sets of three players 


satisfying the stated conditions. 


Section 5.2 


_ 1 2 k-1 k 
bij = PS Fit Ay by + + aye U1 ji? 
Ly .lk 


For item (a), if M = (a;;), use the definition of adjacency matrix to show that a 
generic summand a;,a;; of the sum defining entry (i, 7) of M ? is equal to 1 if 
and only if vz is adjacent to both v; and v;. For item (b), make induction on k. 
Finally, for item (c), use the result of (b) and adapt the argument given to item 


(a). 


. Show that H coincides with the maximal connected subgraph of G containing 


u. 


. For the first part assume, by contraposition, that G has at least two connected 


components, so that G is the union of two disjoint subgraphs H, and Hp, 
respectively with k and / vertices. Use the fact that k + / = n, together with 


discrete optimization (cf. Remark 4.32) to get C) + (5) < (5°): 


. Use the result of the first part of the previous problem. 
. Use the comments preceding the statement of the problem to prove that each 


vertex of G(m, n) belongs to the connected component of one of the vertices 1, 
Dg indy es 


. If G has two connected components H; and Hp, then each vertex of Hj is 


adjacent to all vertices of Hz in G, so that G contains the complete bipartite 
subgraph with independent sets of vertices V(H,) and V(A2). The rest is now 
clear. 


. Create a walk reduction algorithm that, as final result, gives a path between two 


chosen vertices, provided there is a walk from one to the other. Basically, from 
a given walk such an algorithm should eliminate each portion of it of the form 
(v = U0, Uj, ..-, Uk = V). 


. Execute the algorithm delineated in the proof of Theorem 5.24. 
. Adapt, to the present case, the proof of Theorem 5.24. 
. Let C; and C2 be two paths in G, of largest possible lengths. By the sake of 


contradiction, suppose that C; and Cz do not have common vertices. Take a 
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path C in G, joining a vertex of C, to a vertex of C2 and having the smallest 
possible length. Show that C has only one vertex in common with C; and only 
another one in common with C2. Now, concatenate a portion of C;, C and a 
portion of Cz to build a walk in G whose length is larger than that of C; or of 
Cp. 

There are Pa ways of choosing k — | vertices out of v1, ..., Un», and each of 
such choices generates (k — 1)! distinct paths with the stated properties. Now, 
use the additive principle. 

Adapt, to the present case, the proof of Theorem 5.24 in order to build a cycle 
in the given graph. 

Use the result of the previous problem. 

Let A, B, C, D and E be the mathematicians and Aj, Az, By, Bo, Cy, Co, 
D,, Dz and E}\, E2, respectively, be the time intervals during which they slept. 
Build a graph having these time intervals as vertices, with two such vertices 
being adjacent if and only if the corresponding time intervals overlap. Then, 
show that such a graph contains a cycle. 

Start by attributing the labels 1, 2, ..., k to a maximal walk (of length k) in G, 
that does not runs through the same edge twice. 

We first claim that G — € has at most two connected components. In order to 
prove this, let e¢ = {u,v}, and let w 4 u,v be another vertex of G. Since 
G is connected, we have in G a path C joining w and uw. Now, there are two 
possibilities: (i) C does not pass through e: then, we have in G —€ a path joining 
w and u; (ii) C passes through e: since the vertices of C are pairwise distinct and 
u is the last one of them, the next to last must necessarily be v; hence, stopping 
the path C in v, we obtain a path in G — € joining w to v. Anyhow, every vertex 
of G — € can be joined to u or v by a path in G — e€, so that G — € has at 
most two connected components: one containing u and another containing v. 
We now claim that G — € has at least two connected components. Indeed, if 
G — € was connected, it would contain a path joining u and v, and such a path 
would necessarily have length at least 2; adding edge € to such a path would 
give us a cycle in G, which is an absurd. 

Let u be the vertex of G with maximum outdegree. If the edge joining u to v is 
oriented from u to v, there is nothing left to do; otherwise, prove that d Tu) + 
d~(v) >n-—1, and use this to conclude the existence of w € Né (u) 1NG(v). 
For item (a), number the cities from | to n and orient the road joining i and j, 
with i < j, fromi to j. For item (b), suppose the wizard executed his intent 
somehow and note that, if one could always leave every city, then he/she would 
eventually return to a previously visited city, which is a contradiction. This way, 
conclude that there exists a city from which one cannot leave, label it as city 
n and exclude it from the subsequent analysis of the problem. Now, if it was 
always possible to leave each of the remaining n — | cities to another one of 
them, reach an analogous contradiction. Then, conclude that there is one of 
these remaining n — | cities such that one cannot leave it to another one of them 
(i.e., from there one can only reach city 1). Label it as city n — | and exclude 
it from the subsequent analysis of the problem. Repeat this reasoning as long 
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as there are cities to consider and conclude that, when just one city is left, then 
one can go from there to any other city. Finally, for item (c), use the proof of 
(b) to conclude that every possible way the wizard has to implement his rules 
is equal, up to isomorphism, to the one described in (a). Then, show that there 
are exactly n! distinct ways. 

Given an optimal path C = (co, c1,...,¢,2_1), we say that an edge of a unit 
square is traversed if it is a common edge of two consecutive unit squares cj 
and cj, of the path. Consider the graph G having as its vertices all of the 
(n + 1)? vertices of all of the n? unit squares of the chessboard, and as its edges 
the traversed edges of the unit squares. We classify the vertices of G in three 
distinct types: (I) the four vertices at the corners of the chessboard; (II) the 
remaining 4(n — 1) vertices at the edges of the chess; (III) the (n — 1)? vertices 
lying in the interior of the chessboard. Letting d, denote the degree of vertex v, 
we have d, < 3 for every v. Now, note that a path C contains an U if and only if 
some vertex has degree 3. By the sake of contradiction, assume that an optimal 
path C does not contain an U and conclude, with the aid of Euler’s theorem, that 
the number m of traversed edges is at most n* — 1. Then, show that m = n?— 1 
and conclude that the degree of each vertex of type (IJ) is 1, which is clearly 


impossible. 
Let uj, U2, ..., Un be the vertices of the graph and dj, do, ..., dy be their 
respective degrees. Also, let dj1, dj2, ..., dig, be the degrees of the vertices 


adjacent to u;. By the sake of contradiction, assume that 


dj, + dj2 +--+ + dia; = dj + dz +---+dn 
d; n , 


Then, 


n n n 
1 
5 (dij +.dj2 +---+dja,) < +(d4 Dt 
i= i= 


i=1 


_— Gita+---+dn)? 
- 


However, since each summand dj; at the left hand side is counted exactly dj; 
times (one time for each of the d;; neighbors of the vertex having degree dj;), 
it follows that 


Cie ey seernpe or eee 


dj +dy+---+d;< 


In turn, this contradict the inequality between the quadratic and arithmetic 
means (cf. problem 5.2.3 of [8], for instance). 
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Start by noticing that such a problem is a natural generalization of Problem 3. 
Then, adapt to the present setting the discussion of Example 2.29 and the hint 
given to Problem 20, page 32. 

Let G be a graph satisfying the stated conditions, fix a vertex u of G and count 
its neighbors and the neighbors of their neighbors to conclude that k > 5. If 
k = 5, conclude that such a counting computed exactly one vertex of G twice, 
and that such vertex is neither uw nor a neighbor of vu. Then, show that u belongs 
to a single 4-cycle in G and, by the arbitrariness of u, that G can be partitioned 
into 4-cycles, which is an absurd. If k = 6, construct a graph G of 25 vertices 
in the following way: 


e The set V of vertices of G is the disjoint union V = C= Aj, where each A; 
is a Set of five vertices and the subgraph G; of G induced by A; is a 5-cycle. 
° Forl <i < j < 5, establish a bijection ¢;; : A; — Aj such that, if 
x,y € Aj are adjacent in G;, then ¢;;(x) and ¢;;(y) are nonadjacent in G;. 
Then, impose that x and ¢;;(x) are adjacent in G, for all x, i and j as above. 


Finally, show that the graph G thus constructed satisfies the stated condition. 
Let us interpret the problem in the language of Graph Theory, in the following 
way: associate a vertex to each room and put an edge between two vertices 
if and only if there is a door between the corresponding rooms. For two rows 
of nm rooms each, the condition in the statement of the problem amounts to 
the connectedness of the resulting graph G, (of 2 vertices). Now, let a, be 
the number of connected graphs G,, satisfying the given conditions. Firstly, 
check that a} = | and az = 5. Then, use a recursive argument to show that 
An+1 = 4a, + bn, where by, is the number of graphs G, having exactly two 
connected components. Finally, show that b, = 1 + 2a; + 2a2 +--+-+ 2an—| 
and conclude, from this last recurrence, that dj42 = Sdyj+1 — 2d, for every 
n>. 


Section 5.3 


. For the first part, prove that (a) > (c) = (b) => (a). For the second, the 


established equivalences assure that the number of distinct paths in a tree with n 
vertices is equal to the number of distinct ways of choosing two of its vertices. 


. Ifw is a vertex of T with degree k, show that T — u has k connected components, 


all of which are themselves trees. Then, apply the result of Lemma 5.32. 


. For item (a), use the result of Proposition 5.33. For (b), show that if T U {e} 


did not contain cycles, then T would not be a spanning tree of G. For item (c), 
assume that G is not a tree and, hence, that it contains some cycle; show that G 
has at least two spanning trees. 


. Adapt, to the present case, the proof of the correction of Kruskal’s algorithm (cf. 


proof of Proposition 5.36), showing that Dijkstra’s algorithm builds a spanning 
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tree for the given graph and that the weight of such a tree is less than or equal to 
that of any other spanning tree. 


. Start by following Kruskal’s or Dijkstra’s algorithm to find one spanning tree 


with minimal weight. 


. Item (a) simply counts the number of odd integers in the list 1, 2, ..., CG). For 


item (b), start by painting a leaf of T, say in blue. Then, if some (but not all) 
vertices of T have already been painted, choose a vertex v of T which has not 
been painted yet and is adjacent to an already painted vertex u, and paint v with 
the color distinct from that of u. Continue this way until there are no unpainted 
vertices in 7. For (c), show that a path has odd weight if and only if its endpoints 
have distinct colors. Then, use the hint given to the second part of Problem 1. 
Finally, for (d), use the results of (a) and (c), together with the fact thatx+y =n. 


Section 5.4 


1S) 


. Given a vertex a of G and changing G by G, if necessary, we can assume that 


dg(a) > 3. Now, take vertices u, v and w in Ng(a). If two of these vertices, 
say u and v, are adjacent in G, then a, u and v are the vertices of a K3 in G. 
Otherwise, u, v and w are the vertices of a K3 in G. 


. Under the assumptions of (a), the pigeonhole principle guarantees that at least 


one of the remaining 2q — 2 vertices will receive at least (eee og +1=2 
edges from {u, v}; if w is such a vertex, then uuw is a K3 in G. If we are as in 
(b), then G — {u, v} has 2g — 2 = 2(q — 1) vertices and at least (q7 +1)- 
(2g —-2)+1)=(q- 1)?+1 edges. By induction hypothesis, G — {u, v} (and, 
hence, G) contains a K3. 


. Apply the result of the previous problem. 
. Adapt, to the present case, the solution to Problem 2. More precisely, start by 


taking (from the result of that problem) a K3 in G, with vertices a, b and c. 
Now, if one of the remaining seven vertices is adjacent to all of a, b and c, there 
is nothing to do. Otherwise, assume that k of the remaining seven vertices are 
adjacent to at least two of a, b and c, whereas the remaining 7 — k are adjacent 
to at most one of a, b and c. Then, we have at least k + 1 K3’s and at least 
26 — 3 — (2k + (7 —k)) = 16 — k edges in G — {a,b,c}. If k > 4, there is 
nothing to do. Otherwise, k < 3. Separately consider the cases k = 0, 1, 2 and 
3, arguing in the same way as above. 


. Let uj,..., uj_1 be the vertices of a K;_; in G. If uj, ..., uj_1 are all adjacent 


to at least one of the remaining n — (J — 1) = ( — 1)(g — 1) vertices of G, then 

we have a K; in G. Otherwise, at most (/—2)(n—(J—1)) = @-—Dd—-2)(q-1) 

edges depart from one of uw), ..., uj—; to the remaining vertices. Then, H := 
— {uj,...,uj—1} has (i — 1)(qg — 1) vertices and at least 
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Ma I —2)q? -@- DU -2)(q-) ce 20 I) —2)(q — 1)? 
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edges. By induction hypothesis, H contains a K;. 


. Construct a graph G having the teams as vertices and such that two of them are 


adjacent if they have played against each other. The stated conditions assure 
that a(G) < 2, and Proposition 5.46 that w(G) < 2. Conclude that G has at 
most 100 edges and, then, apply item (b) of Problem 9, page 132, to show that 
the tournament must have at least 90 matches. In order to build an example 
with exactly 90 matches, divide the teams in two groups of 10 and, within each 
group, let each two teams have a match. 


. Model the given situation by means of a graph, having the given people as 


vertices and with two people being adjacent if and only if they know each other. 
Then, use the given conditions to obtain upper bounds for a(G) and a(G). 


. Let two of the given points, say A and B, be joined by an edge if and only if 


LAOB > 120°. Elementary geometry guarantees that the resulting graph G 
does not contain a K3’s. Therefore, Turdn’s theorem assures that G has at most 
T (21; 3) = 110 edges. Now, note that the number of arcs we wish to count is 
equal to the number of edges of G, i.e. C) — 110 = 100. 


. Draw the line segment joining two points if and only if its length is greater than 


/2. If G is the resulting graph, use Ptolemy’s theorem on cyclic quadrilaterals 
(cf. Chapter 7 of [9]) to show that G does not contain a K4. Then, apply Turan’s 
theorem to get the desired upper estimate on the number of edges in G. 

Consider the graph G having the n people as vertices and such that two people 
are adjacent if and only if they are friends. Start by observing that G has q edges 
and, by the stated conditions, does not contain K3’s. Show that, for each vertex 
u of G, the number of amicable pairs that can be formed from the enemies of 
u is equal to g — )° cy) 4(v); from this, conclude that we wish to assure the 


2 
existence of a vertex u of G for which ae EN(u) = “o. To this end, note that 


2 


_ gf _ 4¢? 
~ de am= DF aorz—( do am] =—. 


ueV(G) vEeN(u) veV(G) veV(G) 


where the used inequality is that between the arithmetic and quadratic means 
(cf. (cf. problem 5.2.3 of [8], for instance). 

Let m = |,4,] and p = |< |, Also, let A be the set formed by the vertices 
u of G such that d(u) < p, and B = V(G) \ A, so that d(v) > p+ 1 for every 
v € B. Assume, by contradiction, that |A| < m and, hence, that |B| > n —m. 
Use Turan’s theorem to show that G)g contains a K; which is a contradiction. 
Let G be the graph having the 1001 people as vertices and such that two people 
are adjacent if and only if they know each other. Apply the result of the previous 
problem to G. 
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Let each element of S correspond to a vertex, and join vertices a and £ if and 
only if the sets A + a@ and A + 6 are disjoint (here, A+k := {x +k; x € 
A}). Thus, letting A = {a), a2,..., @101}, we conclude that an edge joins two 
elements a and 6 of S if |a — B| ¥ |a; —aj|, for all distinct 1 < i, j < 101. Use 
the fact that #{|a; —a;|; 1 <i < j < 101} < 101- mM to show that each vertex 
of the graph thus obtained has degree greater than or peal to 10° — 101 - 100, 
and note that the problem ends if we find a K 109 in our graph. 


Section 6.1 


10. 


11. 


. Ifnm = (agag_1 ...a1a9)10 is the decimal representation of the natural number 


n, thenn = yy aj 10/ . Now, use the result of Example 6.4, together with 
Corollary 6.9. 


. Write 10 = (11 — 1)* and expand the right hand side with the aid of the 


binomial formula. 


. Write n = ae aj 10/ and use the result of the previous problem, together 


with Corollary 6.9. 


. Ifn = (abc) 0 is the decimal representation of , use the result of the previous 


problem to conclude that a — b+ c = 0 or 11. Then, analyse these two cases 
separately. 


. If n has k algarisms, use the given conditions to show that 9 > 10?4-D — 


10* — 22 and, then, to conclude that k = 1 or 2. 


. Fora EN, start by observing that (a+ 1)(a+2)...(a+n) = fern) 
. Start by showing that a natural number is a multiple of 10 if and only if it is 


at the same time a multiple of 2 and 5. Thus, since the given number is clearly 
even, it suffices to show that it is a multiple of 5; to this end, use the result of 
Example 6.3. 


. Use the result of item (b) of Example 6.3. 
. In item (c), in order to show that S C nZ, use the division algorithm and the 


minimality of n. 

Start by analysing the first part of the problem. To this end, write 294 + 1 = 
(2 — 1) + 2 and factorise 2% — 1. 

Start by applying the existence part of the division algorithm to write m = 
aq + mo, with O < mo < a -— 1. Then, arguing by induction, write g = 
myak—! + my_jak~? +--+ +m, with 0 < mj; <a—1for1 < j <k. Now, if 


m= mya‘ +m,_,a*—! +. --+mja+mg = mia’ +m)_ja'-1+. . -+m',a+mo, 


with O < m;, m < a —1 for all i’s and j’s, use the uniqueness part of the 
division algorithm to conclude that mo and mo, being remainders upon division 
of m by a, are equal. Arguing once more by induction, apply the induction 
hypothesis to “—"° to conclude that k = / and m; = m’ forl <j <k. 
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13. For items from (a) to (e), it suffices to perform judicious applications of the 
results of items (a) and (b) of the previous problem. Regarding item (f), start by 
using the result of item (b). 

14. Start by analysing the case in which x belongs to an interval of the form (n, n+ 
5). for some n € Z. 


15. Start by showing that, under the given conditions, we have a + Ls +1> 


a+b _ | + ab and, then, that 1 + (a+b — 1)(a2 +b? — ab) > a*b?. Finally, 
conclude thata > b> b=1. 

16. Expanding the given equality we obtain (x + y)(x + z) = 2xz. Conclude that 
one of the sides of this last equality is a multiple of 4, while the other is not. 

17. Apply items (a) and (b) of Corollary 6.8. 

18. For item (b), use the result of (a); for (a), writen = 7g +r, withr = 
0, 1,2, 3,4, 5 or 6, and compute n>, 

19. We certainly haven > 1, andcan assume x < y < z;ifx = y, start by showing 
that n = 2; if x < y, write 1 = n?~*(n*-¥ — 1). 

20. For k > 5, show that k! end with 0; then, use the result of Example 6.10. 

21. Use the result of Example 6.10. 

22. For item (a), write n = 2'q, with r being a nonnegative integer and qg an odd 
natural number. Then, use the given condition to conclude that g = 1. 

23. By the sake of contradiction, assume this is not the case, and let N € N be such 
that 


n>N>s@"t!) > 5"). 
Problem | assures that s(3) is a multiple of 9 for k > 2, so that 
sB"t!) > 5B") 49, Wn >QN. 
Adding the above inequality forn = N,N +1,...,M — 1 to get s(3”) > 
s(3%) +9(M —N) = 9(M —c),for some integer c < M, which doesn’t depend 
on the chosen M > N. Now, note that 3” < 10”~ for all sufficiently large M 


(cf. Section 7.2 of [8], for instance); however, since the greatest possible value 
for s(m) when m < 10”~° is 


s(10“~* — 1) = s(99...9) = 9(M —c), 
M-c 


we have arrived at a contradiction. 


Section 6.2 


1. Apply Euclid’s algorithm to a = 21n + 4 and b = 14n +3. 
2. Firstly, write 2” = (n — 1)(n + 1); secondly, compute all possible values of 
gcd(n — 1,n + 1) and, then, use item (b) of Corollary 6.23. 
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. First show that, if there exists a natural number which is a term of both 


progressions, then there will exist infinitely many such natural numbers. 
Secondly, for the characterization of the existence of a natural number which is 
a term of both progressions, use Proposition 6.26. 


. Revisit the hint given to Problem 10, page 163. 
. Start by showing that the gcd we wish to compute divides (n+ 1)!—n! =n-n!. 
. Let d = gcd(a,b) and write a = du and b = dv, so that gcd(u, v) = 1. 


Conclude that ab | (a? + b?) if and only if uv | (u* + v). If this is so, since 
u,v | uv, we have that u, v | (u2 + v’) and, hence, u | v? and v | ue. Now, 
use the fact that gcd(u, v) = 1 to obtain u = v = 1. Alternatively, write 
a* +b? = abk, with k € N, and examine the discriminant of the second degree 
equation x? — (bk)x + b? = 0, which has a as a natural root. 


. Firstly, analyse the case gcd(a, b) = 1. In this case, let the rectangle has vertices 


(0,0), (a, 0), (a, b) and (0, b). Start by showing that if one of its diagonals 
has a point in common with an 1 x 1 square, then it crosses the interior of 
such a square. Subsequently, show that such a diagonal intersects the column 
of 1 x 1 squares bounded by the vertical lines x = j and x = j + 1 in exactly 
L2q7 + IJ + 1-2 fJ squares. 


. Let d be the desired gcd. It follows from the columns’ theorem of Pascal 


triangle (cf. Section 4.2 of [8]) that d divides the sum (a) of the given 
numbers. Therefore, d divides (* i ) _ ‘¢ a = G ae From this, and with 
the aid of Stifel’s relation (once more according to Section 4.2 of [8]), conclude 


that d divides (7 EP (a5); oe (ee), Finally, argue inductively. 


. Let d be the gcd we wish to compute. Then, d divides the sum of the given 


numbers, which (by the lines’ theorem of the Pascal triangle—cf. Section 4.2 
of [8]) is equal to 27”—!. Thus, d is a power of 2. Write n = 2*q, witha ¢ N 
odd, and show that 


2n _ k+l, (2n — 1)q 2n — 2 
2t—1 (2t — 1)(n—2t+1)\2r-2) 


From this computation, conclude that 2+! divides all of the given binomial 
numbers. 

If d = ged(n* +k, (n+1)*+h), then d | [((n +1)? +k) — (n? +k)]. Therefore, 
d|({Qn+ iy —4(n? +k)], and from this it is easy to conclude that d | (4k+ 1), 
and hence d < 4k + 1. Finally, exhibit a value of n for which the corresponding 
gcd is equal to 4k + 1. 

Start by writing a = du and b = dv, with d = gcd(a, b) and gcd(u, v) = 1; 
then, conclude that u, v | c and, hence, that uv | c. 

Let x* + y* — x = 2xym, with m € N. Then, by looking at such an equality as 
a second degree equation in x, show that the fact that x is an integer guarantees 
that the discriminant A = (2ym + 1)? —4y* = 2ym+1—2y)(2ym+1+2y) 
must be a perfect square. Finally, show that gcd(2ym+1—2y, 2ym+1+2y) = 
1 and use the result of Corollary 6.23. 
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Firstly, use the pigeonhole principle (cf. Sect. 4.1) to show that at least one of 
the five odd elements of our set is not a multiple of 3, 5 or 7. Now, let a be 
such an element, and b be any other element of the given set. Use the fact that 
gcd(a, b) | (a — b) to show that if gcd(a, b) > 1, then it is divisible by 2, 3, 5 
or 7. Finally, conclude ie a is the element we are looking for. 

Write aj + b = m| 9+ ae rj, with 0 < rj < m; then, use the condition 
gcd(a, m) = | to eondlude that ro, 71, ..-, “m—1 are pairwise distinct. 

Write r = du and n = dv, so that pedis v) = 1; then, let {“} — ut - [4 
and use the result of the previous problem. 

Use Bézout’s theorem to substitute gcd(m, n) = mx + ny, with x, y € Z, and 
develop the expression thus obtained. 

Start by proving that a,41 = a1 ...d, + 1 (cf. item (a) of problem 15 of [8]). 
Prove item (a) by induction on n and item (b) by induction on k. As for item 
(c), for subitem (i) use (b) to show, also by induction, that Fy, | Fg; for subitem 
(ii), use (i) and (a); for (iii), make induction on g > 0, using (b) and (ii) to 
perform the induction step. Finally, for item (d), adapt Euclid’s algorithm to the 
situation at hand, with the aid of the results of item (c). 

Adapt, to the present case, the several items and hints given to the previous 
problem. 

For item (a), let u and v be natural numbers such that au — bu = 1. The fact 
that n > ab gives us nau — nbv = n > ab and, hence, }* — “* > 1. Thus, 
there exists an integer ¢ such that “* <t < 4°. Letx =nu — bt, y= at —nv 
and obtain (b) arguing by eontradiotion: then, reduce (c) to (a) and (d) to (b). 
Concerning (e), let § = ab — a — b and show that, for 0 < m < S, exactly one 
of the numbers m and S — m can be written as prescribed. 

For (a), arguing by contradiction, let x, y, z € Z4 such that 2abc — ab — be — 
ca = xbc + yac + zab. Then, 


2abe = (x + Dbe+ (y+ Dact+ (z+ Dab, 


so that a | (x + 1)bc. Since gcd(a, bc) = 1, we conclude that a | (x + 1) and, 
hence, that x + 1 > a. Similarly, y+ 1 > band z+ 1 > c. But these give 


(x + lbe+ t+ Dac+ (2+ Dab>a-be+b-ac+c-ab = 3abc, 


which is a contradiction. For item (b), if n > 2abc — ab — bc — ca, then 
n > a-bc —a-— bc and the previous problem guarantees (since a and bc are 
relatively prime) the existence of integers x,t € Z+ such thatn = xbc + ta. 
Without loss of generality, we can assume that 0 < x < a; indeed, if x > a, 
write x = aq + x’, with O < x’ < a, thus obtaining n = x’be + (t + qbc)a. 
For (c), letting x < a — 1, we have 


ta =n —xbc > (2abc — ab — bc — ca) — (a — 1)bc = abc — ab — ac, 
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whence t > bc—b—c. However, since b and ¢ are relatively prime, by invoking 
once more the result of the previous problem we conclude that there exist y, z € 
Z4. such that t = bz + cy. Finally, for item (d), in the notations of the previous 
items we get 


n=xbce+ta=xbc+ (bz +cy)a = xbe + yac + zab. 


Adapt the items and hints given to the previous problem and make induction on 
n> 2. 

For the first part of item (a), use induction. For item (b), show that if 0 < a, < 
a2 < a3 <-:+ < d,_| are the remainders obtained in the execution of Euclid’s 
algorithm when a = dy41 and b = ay, thena; => Fj-; for 1 < j <n. Finally, 
show that (c) follows from (a) and (b). 

Let k be the only positive integer such that 2 < n < 2**!, and let M = 
Icm (1, 2,...,7). Show that 2* | M and 


with u being odd if and only if j = 2". 


Section 6.3 


. Firstly, p + q is even. Also, if p < q, then p < pea < q, so that (p + q)/2 is 


not prime. 


. Start by writing 7 = ye a2 be YE, 4. Then, eroup the 


; j=l fj. j=l 27, jak+1 7 ° : 
summands of this last sum in pairs to write it as a sum of fractions with all 
numerators equal to 3k + 2 = p. Finally, use the fact that p is prime. 


. We can assume that x # y. If x < y, prove that x < p < y and, hence, that 


gced(p,x) = 1. Write 2xy = p(x + y) to conclude that p | y. Let y = pz, 
with z € N, to get z = Ze , thus showing that (2x — p) | x and, hence, that 
(2x — p) | p. Finally, this gives 2x — p= 1. 


. Start by separately considering the cases n = 6k, 6k+1,...,6k+5.Forn = 6k, 


for instance, use the fact that k, 2k+ 1, 3k-+ 1 and 6k + 1 are pairwise relatively 
prime to conclude that k = 1. 


. Start by using the given condition to show that n | a,. Then, if p is a prime 


number which does not divide n, show that p { day. 


. For item (a), use the FTA (for whose proof we did not use the infiniteness of 


prime numbers. For the first part of (b) use, beside the result of item (a), the 
fundamental principle of counting. Finally, show that the result of (b) generates 
the inequality n < 2*,/n, which, in turn, generates a contradiction. 


22 


14. 


15. 


16. 


17. 


Hints and Solutions 577 


. In each case, adapt the idea of the proof of Example 6.40. To this end, observe 


that every prime p 3 is of the form 3k + | and every prime p ¥ 2, 3 is of the 
form 6k + 1. 


. Factorise 22" — 1 and use the result of Problem 4, page 177. 
. Use Lagrange’s identity—cf. Example 2.9—to write CG ie then, use the result 


of Example 6.42. 


. For item (a), show that if n is not a power of 2, then it is possible to factorise 


a” + 1. For item (b), argue similarly. 


. Consider the cases n even and n odd separately. In the case n = 4k, compute 


ged (4 — 1,n); in the case n = 4k + 2, compute gcd (5 — 2, n); finally, in the 


nt 
case of an odd n, compute gcd (n, ust), 


. Factorise cm+j; — cx and, then, apply the result of Problem 6, page 163. 


. Ifn = pj! ... py is the canonical decomposition of n, write u = pi ag re 
and v = pj’... pi‘, with Bj, y; = 0, and compute how many are the 


ordered pairs (6;, y;) for which max{f;, yj} = a;. Then, apply the result of 
Corollary 6.48. 

Write ad = oe with gcd(c, d) = 1, and deduce that b = x? anda = x‘, for 
some x € N. Then, conclude that x = 1 or cx24 = dx°. In the second case, 
analyse the subcases c < 2d,c = 2d and c > 2d. 

If n is odd, then there are no solutions, for the left and right hand sides of the 
given equality have distinct parities. If n is even, thenn = 5+ d5 + dz, with d3 
and d4 having distinct parities. If n is a multiple of 4, show that n = p? + 21, 
where p is the smallest odd prime that divides n; then, show that p | 21 and 
conclude that there are no solutions. If n is of the form 4k + 2, show that n = 
5(p” + 1), where, as above, p is the smallest odd prime divisor of n. Conclude 
that p = 5 and, then, that n = 130. 

Write n = d13q13 = d14q14 = d1s5q15, with q13, g14, gis € N. Condition (b) is, 


thus, equivalent to in + in + is = |. Therefore, start by getting all natural 
numbers a < b < c for which 1 + i + t = 1. 
Examine the cases 1 < n < 30 to conjecture that all naturals of the form 2k 2 


satisfy the stated property. Then, prove this fact by writing, for n = 2" — 2, 


k-1_y pi 
In) = D> d@j- + Yo (a@') +a@ +3) +--+ —2/)) 
j=! i=l 
» kel 
= (21-1) 4 (143454+---+@%%-1) 


2 Fe a4 Cane ai 
— (9k-1 = 
= (2 1) + 7 5 : 
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First note that 3 | n. Now, writen = SPF ge with3 < pj <---< py 
being primes and a > 0, 6 > 1 being integers. Then, note that the given relation 
reduces to 3(a + I)(6+ DK, +)---& +) = 2938 pf! opr, Conclude 
that 6 = 1 or 2 and n does not possess other prime factors apart from 2 or 3. 
Finally, analyse these two cases separately. 

For an arbitrary prime number p, it suffices to show that e,(2m) + e,(2n) = 
€p(m) +e,(n) +e,(m-+n). To this end, use Legendre’s formula, together with 
item (f) of Problem 13, page 164. 

For item (a), use Legendre’s formula to show that e.(2n) > 2e.(n) + | for 
every n € N. Alternatively, use Lagrange’s identity—cf. Example 2.9—and 
induction. For item (b), refine the computations of the first hint to (a) to show 
that e2(2n) > 2e2(n) + 2 if and only if n is not a power of 2. 

By contradiction, let d; < dy < ,/n be divisors of n, so that dj < dy < z < 
a It suffices to show that there exists no perfect square between d; and a3 
Letting d} = a,d. =a+u, Zz =a+vand a =a+u,withO<u<v<uw, 
we have a(a+w) =n = (a+u)(a+v). Ifk > Lis such that (k—1)? <a < k?, 
we claim that k7 < a+ w. Indeed, since /a < k < Ja +1, it suffices to 
show that (,/a + 1)? < a+ .w or which is the same, 2,/a < w — 1. From 
a(a+w) = (a+u)(a+ v) we obtainaw = au +av+uv, so that a | uv and 
a < uv. Itcomes thataw = au+av+uv > aut+tav+a, whence w > u+v+l. 
Then, w—1>u+v > 2,./uv > 2,/a. Finally, we cannot have equality in the 
last inequality, for otherwiseu =v > a+u=a+v>5da)= oe which is 
impossible. 

Writen =agt+ta,-2+a 27? 4 +++ + am2”, with a; € {0, 1} forO < j <m, 
so that ag + a) +--+ + 4m =k. Then, compute 


n a Be . m—j 
Ea =ajt+ajy1-2+---+am-2 : 
and, finally, use Legendre’s formula to show that e2(n) =n —k. 

Let p be prime and ¢ € N be such that (a + kb)(b + ka) = p’. Assuming, 
without loss of generality, thata < b, we havea+kb > b+ka > 1, so 
that there exist r,s € Z for which 1 <r<s,r+s =tanda+kb= p’, 


b+ka = p". Conclude that (b+ka) | (a+kb) and, writing HY? = k e =, 
that (b+ ka) | (k? — 1). Now, consider two separate cases: (i) if p > 2, use the 
fact that b+ka is a power of p and gcd(k—1,k+1) < 2 toconclude that b+ka 
divides k — 1 ork + 1, so thatb+ ka < k + 1 and, hence, b = a = 1. Then, 
show that the solutions are the 3-tuples (a,b, k) = (1,1, p’ — 1), for every 
integer r > | and every prime p > 2. (ii) If p = 2, start by observing that at 
least one of a and b is odd, so that k is also odd. Now, since (b+ ka) | (k* — 1) 
and b+ ka = 2", show that (b + ka) | 2(k + 1) or 2(k — 1); in particular, 
b+ka < 2k + 2 and, hence, a = 1 or 2. If a = 2, arrive at a contradiction; 
if a = 1, we have that (b + k) | 2(k + 1) or k — 1, which splits the analysis 
into that of two subcases: (b + k) | (2k + 2) or (b+ k) | (2k — 2). In the first 
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subcase, show that b = k+ 2 orb = 1. If b = 1, conclude that k = 2’ — 1; 
ifb = k +2, write s = 2u to get (a,b,k) = (1,2” + 1,2” — 1), for some 
u € N. In the second subcase, show that b = k — 2 and, then, let s = 2u to get 
(a,b, k) = (1, 2" — 1,2” + 1), for some u € N. 

Choose a prime p greater than | + 2 + --- + 1000 and show that the set A = 
{p,2p,3p,..., 1000p} satisfies the required conditions. 

For item (a), use the result of Problem 4, page 177, together with the fact that 
pq?) = q for every k > 0. For item (b), factorise q- — 1 and use the 
minimality of k to conclude that one cannot have pq? -—1l)> Che ). 
If t is the product of all primes less than or equal to m, let y) = x;,, where p;, is 
a prime greater than f. Letting p be the greatest prime divisor of yj andr € N 
be such that p = p;, define y2 = x;,, with t2 > r, t;, and convince yourself that 
gcd(y1, y2) = 1. Once you have chosen elements yj = x;, < --- < yj = Xt) 
of A satisfying item (b), with j < m, let p be the greatest prime factor of y; 
and r € N such that p = p;; define yj41 = Xtiays with ¢;+1 > r, tj. Continue 
in this way until 7 = m. Then, show that the set B thus obtained does satisfy 
condition (c). 


Section 7.1 


. If (x0, yo, Zo) is a solution, then (xga”*!, yoa”*!, zoa”) is also a solution, for 


any a € N. Now, get a solution by letting x = y=k+1. 


. Use Proposition 7.1 to write x + y, y + z and x + z in terms of the parameters 


u, v and d, as prescribed by that result. 


. Imitate the proof of Proposition 7.1. More precisely, if x, y,z > 0, start by 


observing that z — x and z + x are both even and, if d = gcd (5, sex), the 


equality 
= 2 
oa (ees (=) 
2d 2d d 


guarantees the existence of u, v € Z such that 5 = 2v? and oie = u*, Now, 
show that gcd(u, 2v) = 1. 


. In each case, use Fermat’s descent method. Specifically for item (d), start by 


showing that x and y leave equal remainders upon division by 3; then, by 
computing (3k + r)?, conclude that if x and y are not divisible by 3, then 
x3 + Sy? is not divisible by 9. 


. Start by writing x = u—v and y = u+v, withu, v € Q, to getx*+xy+y* = 


3u2 + v2. Then, letu = 7 and v = 2, with a, b,c € Zand gcd(b, c) = 1, to get 
Buz +02 = Saas Now, for item (a) use the result of Example 7.3; for item 


(b), adapt the proof of Proposition 7.1 to solve the equation 3a* + b? = c’. 
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. If z is odd, use the result of item (c) of Corollary 6.8 to show that there are no 


solutions. If z is even, use item (b) of that same result to conclude that w, x and 
y are also even; then, apply Fermat’s descent method. 


. Multiply both sides of the given equality by 4, complete squares and write 2x + 


3, 2y + 3 and 2z + 3 as fractions with a common denominator to reduce this 
problem to the previous one. 


. Start by multiplying out to show that it suffices to obtain rational solutions to 


the linear system of equations 


ax + 2cy + 2bz=1 
bx + ay + 2cz =0 
cx + by +az=0 


Then, use Cramer’s rule to reduce the problem to showing that the only solution 
of the diophantine equation a* + 2b? + 4c? = 6abc isa = b = c = O. Finally, 
for this last step, reduce to the case of a, b,c € Z and apply Fermat’s descent 
method. 


. Start by letting A; A2 be a diameter of the given circle. Then, use (7.1) to choose 


points A3,..., A, such that A;A; and A2Aj are rationals, for3 <i <n. 
Finally, apply Ptolemy’s theorem (cf. Theorem 4.18 of [9], for instance) to 
show that A; A; is also rational. 

The solution we present to this problem uses more the ideas presented along 
this section that any specific result we have derived. Anyhow, since it is much 
more difficult than the previous ones, we present a complete solution: start by 
simplifying the stated relation to get a? + c? — ac = b? +d? + bd or, which is 
the same, 


(2a — c)* + 3c* = (b + 2d)* + 3b”. (22.1) 


By contradiction, assume that ab + cd = p, with p prime. Condition a > b > 
c>d> Ogives p=ab+cd >4-3+42-1= 14, so that p > 17. On the 
other hand, 


2p = 2ab + 2cd = (2a —c)b+ (b+ 2d)c 


and, hence, gcd(2a — c, b+ 2d) divides 2p. For gcd(2a — c, b+ 2d) to be even, 
we should have b and c both even, and this would give p = ab + cd also even, 
which is impossible. Therefore, 


gced(2a —c,b+2d)=1 or p. 


We claim that gcd(2a — c,b + 2d) = 1. On the contrary, suppose that 
gcd(2a — c, b + 2d) = p. Then, (22.1) would give us p? | 3(b” — c*); in turn, 
since p # 3, this would assure that p? | (b? — c?). However, p = ab+ cd > b 
forces 0 < b* — c? < p*, which is an absurd. 
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Now, let x = 2a — c and y = b + 2d. Then, gcd(x, y) = 1, and it follows 
m (22.1) that x? — y* = 3(b? — c?) or, which is the same, 


(x—y)aty)=36-—c)b+c). 


We consider two cases separately: 


(i) 


(ii) 


b and c have distinct parities: the fact that p = ab+cd gives gcd(b, c) = 1. 
Since b + c and b — c are both odd, this implies gcd(b + c, b—c) = 1. An 
analogous reasoning also gives gcd(x + y,x — y) = 1. 
If3 | («+ y), then 

x—y = ged(x — y, (b+ .c)(b—c)) = ged(x — y, b +c) ged(x — y, b—c) 
and 
x+y = 3gcd(x+y, (b+c)(b—c)) = 3 gcd(x+y, b+c) ged(x+y, b—-c). 
Writing a = gcd(x—y, b—c), B = gcd(x—y, b+c), y = gced(x+y, b—c) 
and 6 = gcd(x + y,b+c), we getx — y = af andx + y = 3y6. On the 
other hand, 

b—c=gced(b—c,x — y)gcd(b—c,x + y) =ay 
and, analogously, b + c = B6. Solving for a, b, c and d, we obtain 

4a =aP + 656+ 3yd—-—ay, 2b=ay + Bo, 

2c =—ay + Bd and 4d = —aB — 65+ 3y6 —-ay. 

Thus, 
8p = 8(ab + cd) = BS(2a7 + 387). 
However, since b + c is odd, we conclude that both 6 and 6 are odd and, 
hence, B5(2a? + 357) is odd too, which is impossible. 
If 3 | (« — y), we reach a contradiction in a likewise manner. 

b and c have equal parities: in this case, b and c must both be odd, for, 
otherwise, 2 would divide ab + cd = p. Thus, in the above notations, x 
and y must also be odd. It follows that 


gcd(b+c,b—c) = gced(x+ y,x —y) =2, 


for we already have gcd(x, y) = 1. 
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If 3 | (x + y) (the other case is, once more, entirely analogous), then 
writing 
x—y\/(x+y = b—c\ (b+c 
2 a 2 2 
and letting 


oo Be ig 
w= god(*5*. ). p=sca(* 2 


b— b 
y = 20 (*F*, ‘). = goa (*F", **) 


we atrive, as in the previous case, at 2a = a8 + Bd+ 3yvéd—ay,b = 
ay + B6,c = —ay + Bd and 2d = —aB — 65+ 3y6 — ay. Hence, 


2p = 2(ab +. cd) = BS(2a* + 387). 
Neither 6 nor 6 are equal to p, for, otherwise, we would have b > p, which 


contradicts the fact that ab + cd = p. Therefore, 66 < 2, and this gives 
0<d<c<2-ay < 1, whichis an absurd. 


ction 7.2 


. Substitute d = 4k + 3, m = 41 + 3 and use Corollary 6.8. 


2. Just note that x,41 + Yn iv2 = (x1 + yiv2) (Xn + ¥nn/ 2): 


. Imitate the discussion of Example 7.6, observing that x = y = | is a solution of 


the stated equation. 


. If x? — dy? = mand a? — db” = 1, witha, b EN, then 


(a — bVd)(xo + yoWd) - (a + bV'd) (x0 + yoV'd) = m 
or, which is the same, 
[(axo + bdyo) — (ayo + bx0)Vd|[(axo + bdyo) + (ayo + bxo)Vd] = m. 


Therefore, x1 = axo + bdyo and yj = ayo + bxo also solve the given equation. 
Now, observe that 


xf + dy? — (a? + db*)(xé + dyo) + 4abxoyod > ke + dye, 


so that (x1, y1) 4 (x0, yo). 
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Multiply the equality n? + (n + 1)? = m? by 2, complete squares and apply the 
result of the previous problem. 
By completing squares, conclude that the given equation is equivalent to (2x + 
y)* — 5y* = 4. Now, use the result of Problem 4, noticing that x = y = lisa 
solution of the stated equation. 


. By adapting the hint given to Problem 4, show that if u,v € N are such that 


u? — Av? = 1, thena := uxp— Avyg and 6 := uyg — vxo are an integer solution 
of x? — Ay* = 4an. Now, note that 


ax” + bxy + cy? =n => (2ax + by)? — Ay? = 4an, 


so that, in order to generate an integer solution of equation ax* + bxy+cy? =n, 
it suffices to show that it is possible to solve, in Z, the linear system of equations 


eae 
y=6 , 


In turn, such a task is equivalent to showing that 2a | (a — dB), which can be 
done by writing 


a — dB = (ux — Avyo) — b(uyo — vx0) 
= u(xo — byo) + bu(xo — byo) + 4acvyo, 


for the last expression is a sum of multiples of 2a. 


Chapter 8 


2. 


3. 


4. 


Prove first that [9 <aj, ¢ = [[o<ajn 7- Then, let P denote this product and use 
such an equality to show that P* = n@™), 

For 0 < d | n, we have d + ri > 2,/n. Now, add all such inequalities over all 
positive divisors of n. 


For the second equation, show that an ordered pair (x, y) is a solution if and 


only if x = n—- so that n + y must be a divisor of n? greater than n. 
Now, use the result of Example 6.47 and (8.3) to conclude that there are exactly 
5(d (n2) — 1) distinct possibilities for y and, hence, for (x, y). In what concerns 
the first equation, adapt the hint given to the second one to conclude that there 
are d(n2) distinct solutions. Finally, conclude that we ought to have 3d (n?) = 


157, which is an absurd. 


. For item (a), use the result of Lemma 8.3. For item (b), if gcd(m, n) = 1, use 


the result of item (a) to compute |D (mn)| — |D3(mn)| in terms of |D,(m)| — 
|D3(m)| and |D1(7)| — |D3(1)|; then, apply an inductive argument. 
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. For item (a), multiply both sides of the desired equality by n. For item (b), make 


a direct computation for the sum of the positive divisors of 2?~!(2? — 1). 


. The second part of item (b) uses item (b) of Problem 10, page 190. 
. Show that s(ab) > s(a)b, for all a,b € N. To this end, exhibit, in terms of 


the positive divisors of a, a set of positive divisors of ab whose sum is equal to 
s(a)b. 


. Apply the result of Proposition 8.4. 
. Apply the result of Proposition 8.4. 
. Use Proposition 8.4 and the result of Problem | to conclude that both sides of 


the desired equality can be seen as arithmetic multiplicative functions, say F 
and G. Then, if p is prime and a € N, show that F(p*) = G(p%). 


. Apply double counting—cf. Sect. 2.2—first showing that, for 1 < j < n, the 


left hand side counts f(j) exactly l= J times. 


. Start by applying the result of the previous problem; then, apply Proposition 8.4 


to show that F(m) = 1 if m is a perfect square and F(m) = 0 otherwise. 


. If f(¥) = 1 for every 7 € N and F(n) = Vo<aln f(d), then F(n) = d(n) for 


every n € N. Now, apply the result of Problem 12. 


. Use the M6bius inversion formula together with the result of Problem 1. 
. Use the result of Proposition 8.4. 
. Start by proving that, for a fixed positive divisor d of m, there are exactly 7 - 


¢(d) ordered pairs (d, n) as stated. Then, use the result of Corollary 8.13. 


. For item (a), use item (a) of Problem 19. For (b), write S,,(m) = yy (n—a;)” 


and expand the binomial (n — a;)”. For the first part of (c), use (b); as for 
the second part, separately consider the cases n even and n odd and apply the 
conclusion of item (a) in the case of an even n. 

For item (a), choose d in such a way that gcd(m,n) = 5 and a such that 


n 


m = 7-4; then, conclude that gcd(a, d) = 1. For item (b), use the result of (a) 


to show that )° < ain (a) Sx(d) = 1% + 2k +... +n*. Item (c) follows from 
Mobius inversion formula, applied to the result of (b). Finally, for items (d) and 
(e), use the formula of (c). 


Section 9.1 


1S’) 


. Use the Prime Number Theorem. 
. Use the divergence of the harmonic series (cf. Section 7.4 of [8], for instance), 


in conjunction with Lemma 3.15. 


. Use again the divergence of the harmonic series, together with Theorem 9.5. 
. For the first part of item (a) use the fact that, if p is prime andn < p < 2n, then 


gcd(n!, p) = 1; for the second part, use the binomial formula. For item (b), use 
(a), together with the fact that the number of primes p satisfying n < p < 2n is 
exactly 2(2n) — m(n). For the second part of item (c), use the fact that 5 > x2/3 
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andx +2 < ae for x > 8. For item (d), make a direct checking of the cases 
2 <n < 8; then, prove the remaining cases by induction on n > 8. Finally, for 
item (e), use items (c) and (d), together with the (obvious!) inequality z(x) < 
2154 +2. 
5. Note that 
T(n) = (101) +73) +--+ 7M) + (72) +74 +---4+7 2Ln/2])) 


=(14+34+---+i(n)) + (t(0)) + 7(2) +--+ +7 ([n/2])) 


= plilm) + 1P $F Ln/2)). 


This gives (a), and iterating such a recurrence relation, we get (b). For (c), we 
have 


t 


ie as | ay 1 
1m) <5) (4+1) =i (Peter tt) 


1 1 
sa ere ea | 


Now, since t = [logy n| <n, we obtain (d) by estimating 


one BOT sind ci ne zs 
n = = nN 
~4\ 3 3. Allogan}] 9 |logz 2} 


A 


ll 
Alo 
s™~ 
w| 5 
~~ 

mn 

| 
4| 


Ze HE hg eo ee ae + |logyn|+1 
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The estimates leading to (e) are quite analogous and will be left to the reader. 
The first part of (f) now follows immediately from (d) and (e), together with 
T(n) = os Hence, 


12 1 15 3 
eo IO tet 
6+ ~“Ta)~ 6+8 


I 2 
for every n > 1, from where we get lim (n) = ~. Thus, ifr # 5, then there 
n>+o0T(n) 3 


exists ng € N such that 


I 2 
n> mo >| (n) | 


T(n) 3 


In particular, n > no > mn # 3. 
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Section 9.2 


1. From Chebyschev’s theorem, there is at least one prime between p, and 2py. 

2. Letting p be the largest prime which does not exceed n, apply Chebyschev’s 
theorem to show that 2p > n. 

3. Ifn = 2k, write 1!2!...n! = 2*k1(3!5!..., (2k —1)!)* and, then, apply the result 
of the previous problem. The case n = 2k + 1 can be dealt with in an analogous 
way. 

4. Let p and q denote the largest primes that do not exceed m and n, respectively. 
Use the result of Problem 2 to compare the greatest powers of p and q in both 
sides of the given equation. 

5. Start by using the result of Problem 2 to show that n = p, a prime number; to this 
end, follow a path analogous to the hint given to the previous problem. Then, if g 
is the greatest integer which is less than or equal to p — 1, conclude that m > 2q. 
Finally, use the remark that precedes the statement of this problem to conclude 
that g < 5 and, hence, that p — 1 < 6. To solutions are m = n = 2 orm = 10, 
n=7. 

6. Let n > 1 be a natural number satisfying the stated conditions, and p < /n 
be a prime number. Then p | n, for otherwise we would have p* < n and 
gcd(n, p*) = 1. Thus, n is divisible by pi, ..., pe, where pj < --- < px are 
those prime numbers less than ./n. Now, let / € N be such that 2! < /n < 2'+!, 
Since (invoking again the stronger version of Chebychev’s theorem, alluded to 
above) for every integer k > 1 we have at least two integers between 2* and 
Qk+1 we obtain 


Pie. Pe > 2(22...2/-1)? = 2”, 
This way, 


2 
Pg < pi... pPexn< gare. 


so that / < 2. Hence, there are three cases to consider: 


(i) 7 = O: in this case, we have 1 < n < 4, and it is immediate to check that 
n = 2 or 3 are the possible solutions. 
(ii) / = 1: we have 4 < n < 16 and 2 | n, so thatn € {4, 6, 8, 10, 12, 14}. A 
simple inspection show that the possible solutions are n = 4, 6, 8 or 12. 
(iii) 1 = 2: in this last case we have that 16 < n < 64 and 2,3 | n. Therefore, 
n € {18, 24, 30, 36, 42, 48, 54, 60} and, once more by inspection, we obtain 
the solutions: n = 18, 24 or 30. 
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Section 10.1 


10. 


11. 


12. 


13. 


. Make induction on m > 2. 
. Use the elementary properties of congruences, together with the fact that 10 = 


—1 (mod 11). 


. Use congruences to compute the remainder of the given number upon division 


by 4. Then, apply the result of Proposition 10.9. 


. Fork € Z4, show that 7* = 1 (mod 10), 7+! = 7(mod 10), 744+? = 


—1 (mod 10) and qicts = 3 (mod 10). Then, compute the remainder of the 
division of 3!° by 4. 


. If all prime divisors of n were congruent to 1, modulo 4, use the elementary 


properties of congruences to conclude that we were to have n = | (mod 4). 


. The first equality gives us 2°? + 278 . 5+ = 0 (mod 641); the second, 278 . 5+ = 


(—1)* (mod 641). 


. Assume that the decimal representation of n has k + 1 digits, and let m be the 


natural number formed by the first & digits of n. Show that the stated condition 
is equivalent to 6 - 10 + m = 4(10m + 6) or, also, to 3 - 10 = 13m + 12. 
Then, conclude that it suffices to find all natural numbers k for which 10k = 
4(mod 13). 


. Start by observing that 2” + 3” = 0 (mod 7) if and only if 6” + 9” = 0 (mod7), 


which, in turn, is equivalent to (—1)” + 2” = 0(mod7). Then, compute the 
possible congruences of 2”, modulo 7. 

Apply congruence modulo 3 to show that either p or g must be a multiple of 3, 
hence equal to 3. Then, assuming g = 3, write p* + 9p + 9 = n? and solve 
such an equality for p. 

For item (a), start by reducing the terms of the Fibonacci sequence modulo 
2 and modulo 4, and note that the two reduced sequences thus obtained are 
periodic. Items (b), (c), (d) and (e) can be dealt with analogously. 

Start by using the result of item (c) of Problem 2, page 71, to conclude that if 
d = gcd(F,, L,), then d | 4. Now, distinguish the cases 3 { n and 3 | n, and 
apply the results of the previous problem. 

For items (a) and (b), use induction on n € N. For item (c), start by using the 
result of item (a) to write 2Fin+2% = Fin Log + FoxLm. Then use the result of 
the item (b) of Problem 2, page 71, as well as (a) again, to get 


2Fn42k = Fin(Ly + 2(-D‘!) + FeLeLm = 2(-1)*! Fin (mod Ly). 


Now, apply the result of item (b) of Problem 11, taking into account that 2 | k 
and 3 { k. For item (d), argue in an analogous way, writing 


2hing2k = SF in Foe +Lm Log = 5 Fin Fy, Lytle (L2-+2(—-1)*"! = —2L (mod Lx) 


14. 


and, hence, Lm+2% = —Lm (mod Lx). 
Apply item (d) of Proposition 10.9. 


588 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


25: 
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Look at the equation modulo 3 to conclude that m is even. Then, factorize k* — 
2” and argue as in Example 10.11. 

Ifn > 5, show that m? = 3 (mod 10) and, hence, that p > 2. Then, use modulo 
3 to conclude that 3 | m, so that 27 | m?. Finally, look at the equation modulo 
27 to conclude that there is no solution when n > 9. 

Use modulo 3 to show that z is even, say z = 2t. Then, factor 52! — AY and 
argue as in Example 10.11. 

If y = 1, then p = 2andx = 1. If y > 1, write p* = (y+ DO? —y+1) 
and show that, letting d = gcd(y + 1, y* — y + 1), one has d = 1 or 3. Finally, 
consider separately the cases d = 1 andd = 3. 

Write a = 2a and b = 2! B, with k,/ € N and a, 8 being odd naturals. From 
this, show that kb = la and a? + Bl = 2°-k>_ Then, use modulo 4 to conclude 
thata = 6B = 1 and ia = ve Finally, analysing the function f(x) = 3r, show 
that, ifk,/ > 4, then k =I. 

Firstly, show that it suffices to consider the case b = a”. Then, in order 
to analyse it, make induction on n > O, using congruence modulo a at the 
induction step. 

Start by looking at the given equation modulo 3 to show that b is even. Then, 
let b = 2c and conclude that 2°+! = 15 + 1 or 2°+! = 344.5%, Letd =c+1 
and, in the first case, use modulo 3 to conclude that d is even and, hence, that 
24/2 _ | = 34 and 24/2 + 1 = 5“. In the second case, use modulo 4 to show 
that a is odd, so that, if c — 2 > 0, we have 


909 a tt at 5 ee 5 1 S md 2), 


Thus, conclude that c — 2 = 0. 

Using the fact that 1992 = 24 - 83, show that the division of 10!9°* by 10°? +7 
leaves remainder 7*4. Then, if g € N is such that 10!9°? = (108 + 7)q + 7°4, 
use congruence modulo 10 to compute the last digit of qg. 

For item (a), if x is even, say x = 2z, we get (7* — 2)(7* + 2) = 3”. Conclude 
that both 7* — 2 and 7* + 2 must be powers of 3, and use the fact that (7* + 
2) — (7 — 2) = 4 to get a contradiction. For item (b), assuming that if y > 1, 
show that 7* = 3” + 4 = 4(mod9), and that this gives x = 2 (mod 3). Hence, 
deduce that x = 2 (mod 3) and x = | (mod 2) implies x = 5 (mod 6). Item (c) 
now follows from letting x = 6g + 5 and computing 


Te = 7945 = (7®)1. P = (-1)4 - (—2) = £2 (mod 13). 


Finally, for (d) and (e), compute 3” = 1, 3 or 9 (mod 13) and, hence, 3” + 4 = 
5, 7 or 0 (mod 13); then, show that this contradicts (c). 
Looking at the equation modulo x + 1, we obtain (—1)*+! = 1(modx + 1). 
Therefore, from x > 1 show that z is odd. Now, write 


(x $1) = x2 41 = (64+ DO! — 3x27 4... -— x +1), 
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to get (x + 1)97! = xe-! — x2? +... — x + 1. If x is odd, show that both 

sides of this last equation have different parities, which cannot occur. Now, use 
the fact that x is odd, together with 

a: x+1)?-1 - 

ie OFF aety Peet @t)D41, 


to show that y is also even. Then, letting x = 2s and y = 2r, we have 


[@+p'-1][@+D' +1] = @s)%. (222) 


Note now that (x + 1)‘ — 1 and (x + 1)‘ + 1 are consecutive even numbers, so 
that they have gcd equal to 2. Use modulo x to get (x + 1)’ — 1 = 0(mod x) 
and (x + 1)‘ +1 = 2 (mod x), so that the first factor is a multiple of x, whereas 
the second one is of the form xg + 2, with 2 being its only common factor with 
x. Use this to conclude from (22.2) that 


(x + 1) — 1 =2s? 
(x+1)'+1=2%-!° 


Then, finish the solution. 


Section 10.2 


1. Apply Fermat’s little theorem with p = 7. 
2. Start by considering the cases p = 2,3,5. If p € 2,3,5, write the given 
difference as 


(as 10°? = 10°) 2G 10? = 104 9 = 1, (22.3) 
where a = 11...1. Then, apply Fermat’s little theorem to conclude that a = 
— = 


1 (mod p) and _ analyse the remainder of the division by p of each summand 
of (22.3). 

3. Show that, after performing k > 1 operations, the j-th term of the sequence 
goes to position 2* j (mod 2n + 1). Then, use Fermat’s little theorem. 

4. Firstly, use modulo 11 to conclude that there are no integer solutions such 
that x = O(mod 11). Then, if x #4 O(mod11), use Fermat’s little theorem 
to conclude that 11 divides x> + 1 or x° — 1 and, hence, that y? = —5or 
—3 (mod 11). Finally, show that such congruences do not have any solutions. 

5. We may assume p > 2. In this case, ifn = pq +r, Fermat’s little theorem 
assures that it suffices to find an integer 0 < r < p such that 277” = r (mod p) 
for infinitely many naturals q. For what is left to do, choose r = 1 and, then, 
infinitely many q’s. 
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6. 


10. 


11. 


12. 


13. 


. For item (b), start by using the fact that gcd(p,q — 1) = 1 to write px 
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For the sake of contradiction, assume that there exists an integer > 1 such that 
n | (3" — 2”). If p is the smallest prime factor of n, with n = mp, use Fermat’s 
little theorem to conclude that 3” = 2” (mod p). Then, use the minimality of 
p to show that gcd(m, p — 1) = 1. Finally, write mx = (p — l)y + 1, with 
x,y € N, and use Fermat’s little theorem once more to reach a contradiction. 


(q — 1)y + 1, with x, y € N. Then, apply Fermat’s little theorem. 


. If k stands for the number of digits of 5”, show that the problem is equivalent 


to finding m > n for which 5” — 5” = 0 (mod 10‘). Prove that n > k and 
conclude that it suffices for us to have 5’”"~” 1 (mod 2‘). Then, use Euler’s 
theorem. 


. With the aid of Euler’s theorem, prove first that if / is odd, then ged(2?.0™ — 


3,1) = 1, for every m € N. Then, choose ky, ..., ky inductively. 

Use MoObius inversion formula to get a, = Xo <d\n w(d)22, Now, use Euler’s 
theorem to show that, if p is prime and p® is the greatest power of p that divides 
n, then p® | ay. To this end, write n = p%k, with gcd(k, p) = 1, and, from this, 


of p&k Dek 
an = > pda = \* w(dy2*a + S* w(pd)2 7 


O<d|p%k O<d|k O<d|k 
pok pele 
= Do wT — YP nar 
O<d|k O<d|k 
pel P pel 
- Ew (0-2 
O<d|k 


yo Ls 
> nara Qa — i) 
k 


O<d 


Dy mcayar a (205 — 1), 


O<d|k 


For item (a), argue by contradiction. More precisely, ifa = (p1—1)...(pe—1), 
with k > 1 minimum as in (a), use the fact that ged(2¢/? — 1, 24/2 + 1) = 1 to 
conclude that there exist / < k primes among p1,..., Dx, SAY P1,---, Pi, Such 
that 27/2 —1 = pi... p/". Then, letting b = (p; — 1)... (py — 1), use the fact 
that (2? — 1) | (24/2 — 1) to contradict the minimality of k. 

For item (b), use Fermat’s little theorem. For (f), if g { x, show that x9-D/?2 = 
+1 (mod q), and analogously for y¢@~!/? and z4~-)/; then, use the equality 
xQ-D?2 4 ya-D/? 4 74G-D/? — 0, together with q > 5, to get a contradiction. 
Finally, for (g) ii., observe that if g | a and gcd(a, d) = 1, theng { d. 

For item (e), you will need to use the result of Example 8.14. For item (h), first 
show that gn = 2”~'g,+(2"~! — 1), for every integer n > 1; then, use Fermat’s 
little theorem to show that, if gi is an odd prime, then it is possible to choose 
n > 2 such that qq | gn. 
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Section 10.3 


1. We prove both items simultaneously. If x is an integer such that ax = b (modn), 
then we have ax = nq +b for some q € Z. Therefore, b = xa + (—q)n, a linear 
combination of a and n, so that Bézout’s theorem furnishes gcd(a, n) | b. Now, 
let gcd(a, n) = d, with d | b. Then, 


b(modn) ~ & a d=) 
=> mo —x =>— {moda—}]. 
ax n a a a 


However, since gcd (S, 7) = |, the existence of solution follows from Corol- 


lary 10.25. For what is left to do, start by noticing that 
n n 
ax = axg (modn) & x = xo (mod =) &x=xot qa dt eZ. 
On the other hand, it is immediate that 


xo + at =xot+ oh (modn) © t, = fo (modd), 
so that there are as many pairwise incongruent solutions for the original equation 
as there are integer values for ¢ pairwise incongruent modulo d. Thus, the original 
equation has exactly d = gcd(a, n) pairwise incongruent solutions modulo n. 
2. Assuming that the given congruence has any solution at all, use Bézout’s 
theorem, together with item (d) of Proposition 10.6, to show that the gcd between 
gcd(a\, a2,..., ax) and n divides b. Then, observe that 


gcd(gcd(a, a2,..., ax), n) = gcd(a1, a2,..., ax, n). (22.4) 


For the converse, apply Bézout’s theorem once more, together with (22.4). 

3. Letting x denote the total of soldiers, show that it satisfies a system of two linear 
congruences with relatively prime moduli. Then, use the proof of the Chinese 
remainder theorem to show that 


x =7-12-12+5-1-13(mod12- 13) 


and, hence, that x = 132q + 17 for some g ¢€ Z. Finally, use the fact that 
600 < x < 700. 

4. For item (a), start by observing that, if uv, v is a solution of the linear diophantine 
equation and x = m,u + a, = mzv + ay, then x solves (10.7) for k = 2. 

5. Apply Euler’s theorem. 

6. If p and gq are distinct primes, Example 6.29 assures that 2? — 1 and 27 — | are 
relatively prime. In view of this fact, show that an integer x satisfies the stated 
conditions if and only if it solves a certain system of linear congruences. Then, 
apply the Chinese remainder theorem. 
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7. Adapt, to the present case, the idea of the proof of Example 10.28. 
8. For implication <— of item (a), let aj € Z be an integer solution of f(x) = 
0 (mod Pp," ). If x is any solution of the system of linear congruences 


x = aj (mod p}') 
x = az (mod p5”) 


x = ag (mod p;") 


whose existence is guaranteed by the Chinese remainder theorem, use item (c) 
of Proposition refprop:propriedades da congruencia to conclude that f(x) = 
0 (mod P;') for 1 < i < k and, hence, that f(x) = O(modm). For item 
(b), let S(t) be a set of N(t) pairwise incongruent solutions, modulo ft, for the 
congruence f(x) = O0(modtr). Use the result of (a) to infer the existence of a 
bijection f : S(m) > S(p{'') x --- x S(py*); then, apply the fundamental 
principle of counting. 

9. If a is the initial term and r the common ration of the progression, the given 
hypotheses are equivalent to the existence of x, y € N such that x” = a (modr) 
and y> = a(modr). By the same token, the thesis is equivalent to the existence 
of z € N such that z° = a (modr). Make complete induction on r, the case 
r = 1 being trivial. For the induction step, separately consider the cases r = p%, 
for some prime p and some a € N, andr = st, with s,t > 1 relatively prime. 
In the first case, if x and y are as above and y’ is the inverse of y modulo p®, 
show that z = xy’ solves the problem. In the second case, let u and v be terms of 
the arithmetic progressions (a + ks)x>0 and (a + kt)x>0, respectively, such that 
ui =a (mods) and v’=a (mod t); use the Chinese remainder theorem to find 

z € N such that z° = a (modst). 


Section 11.1 


1. Note that aq, +a. +--- +a, =1+2+---+n(modn). 

2. Adapt the proof of item (b) of Proposition 11.3 to the present case. 

3. Adapt the discussion of Example 11.4 to this case, this time using the result of 
the previous problem. 


Section 11.2 


2. This amounts to noticing thata-(6+¢) =a-b+c=a-(b+c)=a-b+a-c= 
a-b+a-C€=a4-b4+4-C. 
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3. Ifa = Cand b = d, thena = c(modn) and b = d (modn), so that a= b= 
c — d(modn). But this is the same as saying that a = c and b = d imply 
a —b=c —d, whichis exactly what we wanted to verify. 


4. The units of Z2 are 1, 5, 7 and TI (note that (12) = 4). Since l =5° =7 = 
Ai = 1, there is nothing left to do. 

5. Adapt, to the present case, the discussion of the well definiteness of the 
operations of addition and multiplication in Z,. 

6. For the first part, ifn > 1 is not prime, write n = ab, with 1 < a, b <n. For the 
second, note that if @- b = 0 in Zp, then p | ab and, hence, p | a or p | b. 

7. For the first part, look at the particular case @- x = 1. For the second, multiply 
both sides of @- x = b by a! and use the associativity of multiplication. 


Section 12.1 


2. By contradiction, if 23" = —1 (mod 17), then 273" = 1 (mod 17). Now, use the 
congruence 2!© = 1 (mod 17) to conclude that ord,7(2) = 1 or 2, which is a 
contradiction. 

3. If m = ord, (a), then m | 2k and, by Euler’s theorem, m | y(n). Hence, m | 
gced(2k, p(n)) = 2d, where d = gcd(k, g(n)/2) (recall thatn > 2 > g(n) 
even—cf. Problem 19, page 220). If m | d, then m | k, which contradicts the 
given congruence. Therefore, m = 2d’ for some divisor d’ of d, and hence m is 
even. 

4. If n | (2” — 1), then n is odd. If n > 1 and k = ord,(2), then 1 < k | n, y(n). 
Now choose, from the very beginning, the least possible natural number n > 1 
such that n | (2” — 1). Arrive at a contradiction by showing that k | (2k —1). 

5. Let S denote the given product. Legendre’s formula (6.10) assures that the 
greatest power of 2 that divides n! has exponent 


+00 


a= [5] <y an 


j=l j=l 


therefore, e2(n) < n — 1 and, hence, 2°) | S. Now, let 2 < p <n be prime 
and / be the exponent of the greatest power of p that divides S. Use the fact that 
2” — 2' = 0 (mod p’) if and only if ord ni (2) | (n — t), together with the idea of 
the proof of Legendre’s formula, to conclude that 


n n 
~Elatm|-ELsl- 


j2l jzl 


6. Show that kj; = 2k; (mod 2n + 1). Then, for item (a) conclude that f(n) = 0 
if and only if the set {1, 2, pian 3. .} contains a RRS modulo 2n + 1. For item 
(b), show that 
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f (1997) = 2- 1997 — ord. 1997-41 (2) 


and compute ord?.19974.1(2) = 8-23 = 184 by using the equalities 2.19974 1 = 
5-17-47, ordj7(2) = 8 and ord47(2) = 23. 


. Assume that we have found gq as in (a), so that p | (q — 1). Arguing by 


contradiction, suppose further that there exists n € N such that n? = p (modq). 
Since p # q we have gcd(n,q) = 1, and Fermat’s little theorem gives 
q-l 


l=78 "=o 7 (mod q), which is mnpossthle For (b), note thata = p?~!+ 
pe-* 4.--4+ p+1= p41 (mod p’), so that a has at least one prime factor q 
for which g # 1 (mod p”). In turn, if k = ord, (p), then the fact that a | (p? — 1) 
gives k | p;ifk = 1, we would have 0 = a = p?~'+ p?-?4+.--+p+1 (modq), 
so that p = q, acontradiction. 


Section 12.2 
1. Compute a | (mod 29) and, then, conclude that ord29(2) = 28. 
2. Separately consider each one of the cases n = 2, 4, p* and 2p*, with p being 


an odd prime and k € N. 


. Letting a be a primitive root modulo n, note that the a;’s are congruent modulo 
n, is some order, to a, a”, ..., a”, so that 
vny(g(n)-+1) 
aja2...dg(n) =a 2 (mod n). 
: : ; e@)oa)+) 
Now, if m = 2 or 4, give a direct argument to show that a 2 = 
—1(modn); if n = p* or 2p*, for some odd prime p and some k € N, 
(n) 
use the fact that ord,(a@) = g(n) to conclude that a? =-1 (nod n)—be 
(n) 
careful: n | (aos +1)(a 1) does not necessarily imply n | (a” - + 1) or 
(n) 
altaz =I), 


. Firstly, adapt the proof of Lemma 12.12 to the present case, showing (by 


induction) that for every integer n > 3 there exists an odd integer g such that 
ioe 8 + 1. Back to the posed problem, start by taking kj = 1, and 
ky = 2. Then if 19% — 97 = 2”q and q is even, take kn41 = kn; if g is odd, 
take kn+1 = ky +2"-?. 


. Ultimately, the solution of this problem also relies in a variation of the idea of 


the proof of Lemma 12.12. Indeed, one wants to find infinitely many m € N 
such that m? +7 = O(mod 2‘). Start by showing that it suffices to establish the 
existence, for every / € N, of m; € N such that me+7 = O(mod 2! ); then, prove 
this last statement by induction on /. For the induction step, if me +7=2'n), 
sn 1 > 3 and n; odd, try mj4; = am; and show that it is enough to have 
a? = 2'q + 1, with g odd; then, prove that a = 52° works. 


22 
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. If a?4-! = 1 (mod3 pq), then a3?4—! = 1 (mod p); in particular, letting a be 


a primitive root modulo p, we must have (p — 1) | (3pq — 1) and, analogously, 
(¢q — 1) | Gpq — 1). From this, conclude that (p — 1) | (3g — 1) and (¢g — 1) | 
(3p — 1), from where p = 11 and g = 17 or vice-versa. Finally, show that 
a> !!17-! = | (mod3- 11-17) for every natural number a relatively prime with 
3-11-17. To show that p = 11 and gq = 17 if p < q, one can argue in the 
following manner: since p < q, we have 


3p—-1 _3q-1 2 2 
E < Z =34+—— 33+, 
q-1 q-1 q-1 4 
so that et : = 1, 2 or 3; analysing each one of these cases separately, we 


3q-1 _ 9q—3 
p-l ~ 2q-4’ 


than 4 and less than 8; thus, a = 5, 6 or 7, and these possibilities lead to 
q = 17 as the only actual one. 


Seite: fet 3p — 1 = 2(q — 1) and, hence, a number greater 


. For item (d), take a separate look at each one of the cases ord, (x) = 1, 2, pet 


and p — 1—here we used the fact that pot is also prime. 


. For (a) = (b), writen = p,... px, with pj < --- < px being primes such 


that (pi — 1) | (v — 1); then, apply Fermat’s little theorem to conclude that 
a” = a(mod pj), for every 1 < i < k. For (b) => (a), take a prime divisor p 
of n and let a be a primitive root modulo p* to obtain a contradiction if p? | n; 


then, let a be a primitive root modulo p, to show that (p — 1) | (n — 1). 


qpia-l) —q'-! 
Sn = 1+4: { ——=——] (mod p)). 
a -—1 


For item (a), observe that 


pq-1p-l pq—-1p-1 
> jit" = r= pg: ae 1 = 0(mod p). 
j=0 1=0 j=0 1=0 


For item (b), use (a) to conclude that, if n = p*q, with p prime and q natural, 
then s, = 1 (mod p). Item i. goes through with a computation analogous to that 
of the hint given to item (a). Item ii. follows immediately from i., together with 
the fact that, modulo p;, we have {a, a’, Sic aPi-'} = {1,2,..., pj — 1}. Fpr 
iii., if n | s,, then s,. = 0 (mod p;). However, since p; | (a?i"~) — a”—!), the 
formula in ii. guarantees that the only way of having s, = 0 (mod p;) is to have 
Di | a —1). Then, p; —1 = ordp; (a) | (1-1) and, hence, (p; —1) | (qi—1) 
(since n — 1 = (p; — Dqi + (qi — 1)). For item iv., ifn | sy, then p; | s,, and 
it follows from i. and iii. (more precisely from (p; — 1) | (n — 1)) that 
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pi-1 pi-1 


0=s5,=l+q So =14+q >> 1=14+4i(p — 1 (mod pj). 
l=1 l=1 


Hence, g;(pi — 1) = —1 = p; — 1 (mod pj), so that g; = 1 (mod p;). Finally 
in what concerns item v., it is enough to show that if pi(p; — 1) | (q@ — D 
for 1 <i < t, then p; | s, for 1 <i < f; to this end, it suffices to use 
the congruence of item i., together with the fact (easily deducible from our 
hypotheses) that (pj — 1) | (nm — 1). 

For item (a), if ged(a* +1, 2) > 1 thenn | (a*+1), and hence a2* = 1 (modn); 
this contradicts the fact that ord,(a) = 2pk. For i., it follows from aP = 
—1 (mod n) that a2kP = 1 (mod n), and thus d | 2kp = n—1. On the other hand, 
if d | 2k, then a2* = 1 (modn) and, this way, n | (aX — 1)(a* + 1); however, 
since gcd(a* +1,n) = 1, we getn | (ak — 1) and then a*? = 1 (modn), an 
absurd. Now, if d | 2kp and d { 2k, it is immediate to see that gcd(d, p) > 1, 
and hence that p | d; therefore, the last part of i. follows from Euler’s theorem. 
For ii., it is pretty clear that p + (2kp + 1); therefore, the formula for g(2kp + 1) 
assures that p | (¢ — 1), for some prime divisor g of 2kp + 1; thus, it suffices to 
let q — 1 = Ip, noticing that / > 1, for otherwise both p and gq = p+ 1 would 
be odd primes, which is impossible. Finally, for iii., if 2kp + 1 = (Ip + l)u 
then u = | (mod p), so thatu = hp +1 for someh > 1. If h > 2, then 
n= (Ip+ (hp +1) > 2p +1) > 22p4+ 1)p+1 > 2kp+1 =n, which 
is impossible. Therefore, h = 0 or 1, andif h = 1 then (p+ 1) | Qkp+ 1), an 
absurd (for p + | is even and 2kp + | is odd. 

Show? first (by induction) that for every k € N there exists a natural number 
mg of k digits, all of which equal to | or 2, such that ok | mx. Now, note that 
5 { qdiooo, and use the fact that 2 is a primitive root modulo 51000 to findk € N 
with 2* = q1000 (mod 51009), Finally, writing 2 = gg 51 qio00, for some 
a €N, show that 2‘+!9 has the desired properties. 

Item (a) follows from Fermat’s little theorem, together with the fact that ooh + 
xj + x} + xf + xf = 0(mod p) if and only if f(x1,...,x5) = 1 (mod p). 
Item (b) follows from the multinomial formula desenvolvimento multinomial 
(cf. Problem 2, page 28) and from the result of (a). 


Section 12.3 


1. 
2. 


Use Euler’s criterion. 
Adapt the proof of Example 12.24 to the present case to obtain infinitely many 
integer values of k having the desired property. 


>This is based on the solution given by professor Carlos Gustavo Tamm de A. Moreira. 
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. Start by multiplying equality ax? + bxy + cy? = n by 4a and completing 


squares. 


. First show that if such x and y did exist, then they would be relatively 


prime with 122. Then, use the result of the previous problem to conclude 
that 17 should be a quadratic residue modulo 61. Finally, apply the quadratic 
reciprocity law and Proposition 12.23 to reach a contradiction. 


. For item (a), if every prime divisor of 2b* + 3 was congruent to +1 modulo 8, 


we would have 2b? + 3 = +1 (mod 8); show that this is impossible. For (b), if 
(2b? + 3) | (a* — 2) and p is as in (a), then a* = 2 (mod p), so that 2 would 
be a quadratic residue modulo p; use the result of Example 12.27 to reach a 
contradiction. 


. Let L, = 2x. If n is odd, use item (b) of Problem 11, page 252, together 


with the fact that L, is even, to show that n = 3 or 9 (mod 12). Then, apply 
several times item (e) of this same problem to get L, = L3 or Lo = 4 (mod 8), 
and show that this gives x* = 2 (mod 4), which is impossible. Now, assuming 
that n is even, consider the following three separate cases: n = 0 (mod 4), 
n = 6(mod8) orn = 2(mod8). Ifn = 0 (mod 4), writen = 2-3!-k, withk € Z 
such that 2 | k, 3 { k. Argue as in the proof of Example 12.25 to obtain L, = 
—Lp = —2 (mod L,) and, hence, x2 = —1 (mod L;). Show that this furnishes 
a contradiction, exactly as the one obtained in the proof of Example 12.25. If 
n = 6(mod 8) but n = 6 (note that Lg = 18 = 2 - 3), writen =6+2-3!-k, 
with k € Z such that 4 | k and 3 { k. Use item (d) of Problem 13, page 253, to 
obtain L, = —L6 = —18 (mod Lx), and hence x? = —9 (mod L,). Also as in 
the proof of Example 12.25, conclude that one can take a prime divisor p of Lx 
such that p = 3 (mod 4), so that x? = —9 (mod p). Then, show that 3 { Lx, so 
that ged(p, 9) = 1 and, thus, (>) = (>)(>") = (>') = —1. Finally, in the 


P/\ p P 
case n = 2 (mod 8), just note that L_, = L, and —n = 6 (mod 8). 


. Let F, = x2. Ifn = 1(mod4) but n # 1, writen = 142-9) .k, with 


k € Z such that 2 | k, 3 { k. Use item (c) of Problem 13, page 253, to 


get F, = —F, = —1 (mod Lx); then, arrive at a contradiction, exactly as in 
the proof of Example 12.25. If n = 3 (mod4), use the fact that F_, = F, 
and —n = 1|(mod4). If n is even, write n = 2m and apply item (a) of 


Problem 13, page 253 to get F, = FL. Then, distinguish two distinct 
subcases: if gcd(n, 3) = 1, use Problem 12, page 252 to conclude that L,, is 
also a perfect square; then, apply the result of Example 12.25. If gcd(n, 3) = 3, 
use Problem 12, page 252, again to conclude that L,, is twice a perfect square; 
then, apply the result of the previous problem. 


. Apply Gauss’ lemma, as was done in Example 12.27, or the quadratic reci- 


procity law. 


. For item (a), if every prime divisor of 2” — 1 was congruent to +1 modulo 12, 


we would have 2” — 1 = +1 (mod 12); show that this is impossible. For (b), if 
(2” — 1) ¢{ (3 — 1) and p is as in (a), then 3” = 1 (mod p) or, which is the 
same, 3’"+! = 3 (mod Pp); since m + | is even, it would follow that 3 would be 
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a quadratic residue modulo p; now, use the result of the previous problem to 
get a contradiction. 

For item (a), use the quadratic reciprocity law. For item (b), use the result of 
Corollary 12.20, together with the fact that p — 1 is a power of 2. 

For item (a) make induction on k > 3, showing that if He = a (mod Qk ), then 
am =a (mod 2‘*!), with Xk+1 = XK OL XR + 2k-! For (b) make an analogous 
induction; more precisely, if xf = p‘q +a, let xp41 = xR + p*t, with t € Z, 
and impose the validity of the congruence oe 4, = 4 (mod p**!) to deduce that 


t must satisfy the linear congruence 2x,t = —g (mod p); finally, show that it is 
always possible to choose such a ¢. 
If x2 = a(modn) for some integer x, then x2 = a(mod2*) and x2 = 


a (mod pe ), for 1 < i < f; it then suffices to apply the results of the 
previous problem, together with Euler’s criterion. Conversely, if conditions (i) 
and (ii) are satisfied, apply once more the results of the previous problem and 
Euler’s criterion to assure the existence of integers xo, x1, ..., xX; such that 
Xa =a (mod 2*) and a = a (mod pi ), for 1 <7 < ft; then, invoke the chinese 
remainder theorem to get, out of the x;’s, an integer x such that x=a (mod n). 
Number the children in the counterclockwise sense and in a continuous way, 
so that the child at positions 1,n + 1, 2n + 1, 3n + 1, ...is the same one, and 
analogously for the remaining children. Show that the teacher will give candies 
to the children situated at positions i. for k € N, so that it suffices to 


find all natural values of n for which the congruence xO) =a(modn) hasa 
solution for every integer | < a < n. Now, notice that if such a congruence has 
a solution, then the same holds with the congruence (2x + 1)? = 8a+1 (modn). 
Finally, use the results of Problem 12 to show that n must be a power of 2. 

For =>, use Fermat’s little theorem. For <, let a be a primitive root modulo 


p, and j € N be such that 57 = —1 (mod pot); then, take k € N such that 
ak = al (mod p) and show that k | d and x9 = ak/4 solves the congruence 


x” =a (mod p). 

Items (a) to (d) only require simple algebraic manipulations and elementary 
properties of congruences. For the first part of item (e), use Wilson’s theorem, 
together with Euler’s criterion; for the second part assume, by the sake of 
contradiction, that r; = rj; (mod p) and use the fact that p t ( ti — i*). For 
the second part of (f), conclude that 


where, in the last congruence, we used the fact that p > 5. 
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1. Start by showing, with the aid of Proposition 10.9, that if such integers k, 1, 
x, y and z did exist, with / > 1, then x, y and z would all be even. Then, 
use this to reduce the analysis to the case / = 0, and use again the results of 
Proposition 10.9. 

2. By contradiction, suppose that a+b is even. Then, a+c? = a*—b* = 0 (mod 4), 
and hence a = —c* (mod 4). From this, conclude that a(a— 1) has a prime factor 
congruent to 3 modulo 4 and which appears in its canonical decomposition with 
odd exponent. 

3. Observe that 1995 = 3-5-7. 19, with 3,7,19 = 3(mod4). On the other 
hand, if d is a positive divisor of 1995 (the case d < 0 can be handled in a 
totally analogous way), analyse separately the following four possibilities: d = 1, 
d=5,1<d|(3-7-19) andd = 5d’, with 1 < d' < 3-7- 19. For instance: (i) 
if d = 5, then x7 + y* = 5(x — y) if and only if (2x — 5)? + (2y +5)* = 50, so 
that 2x —5 = +1, +7; (ii) if 1 < d | (3-7-19), then, since x + y* = 0 (mod 4d), 
the argument sketched in item (ii) of the proof of Theorem 12.33 guarantees that 
d|xandd | y; setting x = da and y = db, we obtain a* + b* = a — b, which 
is equivalent to the first of the four possibilities above. 

4. Example 6.40 guarantees the existence of infinitely many primes of the form 4k+ 


3; then, choose distinct primes q1, ..., Gn, all of which congruent to 3 modulo 4, 
and apply the chinese remainder theorem, together with Theorem 12.33, to the 
system of linear congruences x = —i + q; (mod q?). forl <i<n. 


5. For item (a), note that |S| = a For (b), use (a). Item (c) follows from (b), 
together with the fact that y? + ys + y3 + yz = mp (mod p). Finally, for (d), 
start by using a descent argument to show that p can be written as a sum of four 
squares; then, apply several times identity (12.10) to show that any n € N can be 
written as a sum of four squares. 

6. For item (a), assume that n has an odd divisor d > 1. Then, since (24 — 1) | 
(2” — 1), we have that (24 — 1) | (m* +9), which is a sum of two perfect squares. 
However, since 24 — 1 is of the form 4k — 1, he must have some prime divisor p 
which is also of this form. It now follows from a slight modification of the proof 
of Theorem 12.31 that p | m and p | 3, and hence p = 3. Therefore, 3 | (27 — 1), 
which is an absurd, for 24 — 1 = (—1)¢ — 1 = —2 (mod 3) (recall that d is odd). 
For item (b), start by observing that the case k = | is trivial. Now, assume that 
(22" — 1) | (m2_, + 9) and notice that (22° ' + 1) | (3-22)? + 9). Then, 
with the aid of the chinese remainder theorem, choose m; € N such that 


my = mp_1 (mod 22’ — 1) and my =3-22 (mod2"' +1). 


sat : : k-1 k= bas : 
It is immediate to verify that 2 —~land2” +1 divide me +9; however, since 
such numbers are relatively prime, we conclude that their product also divides 
me + 9. 
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2. For item (b), write |z + w|* = (z+ w)(z + w) and use the result of item (b) of 
Lemma 13.2, together with (13.8). 

3. For item (a), apply inequality |z + w| < |z| + |w|, with w — z in place of z 
and z — v in place of w. For item (b), observe that the desired inequality is 


equivalent to —|z — w| < |z| — |w| < |z — w|; then, in order to show that 
|z| —|w| < |z—w, apply inequality |z+ w| < |z|+|w|, writing z— w in place 
of z. 


4. If z can be written as in the statement, use the second part of item (a) of 
Problem 2 to conclude that |z| = 1. Conversely, suppose that |z| = 1. Impose 
that z = ie. with w € C, to obtain w = i i: then, use items (b) and (d) of 
Lemma 13.2, together with the fact that |z| = 1, to conclude that w € R. 

5. Expand both sides of the inequality |z — a|? < |1 — @z|?, with the aid of the 
result of item (b) of Problem 2. 


6. First note that 
ze S |[z—al* = R? © |z|" — 2Re(G@z) + |a|* = 


1 1 2 
Then, set w = ~ to successively obtain F = a| = R*, hence |a|?|a~!—w|* = 
R?|w/?, then |w|?(\a|?— R?) —2Re(aw)+1 = 0. Finally, write this last equality 


as |w|2 2Re( 


aw 1 —_ 
ial2—R2 + az—R? = 0 and complete squares. 


7. First note that zp41 = 7x (1 + si). Then, compute |zx| in terms of k by 
using telescoping products (cf. [8], for instance). Finally, note that zx41 — zx = 

8. Review the definitions of addition and subtraction of vectors, in Chapter 8 of 
[9]. 

9. Assume such a total ordering does exist and arrive at a contradiction. You may 
wish to use that i = —1. 

10. Checking items (a) to (e) is somewhat tedious, though elementary. In what 
concerns (f), the first part is immediate from the definition of multiplication 
in H; for the second part, it suffices to use the identity |a8|* = aBaB, together 
with the result of item (e): Item (g) follows from the second part of (f). Finally, 
if a € H \ {0}, then a - “, = 1; therefore, if a8 = 0 anda ¥ 0, then 


a a 
wade esl ictenes 


On the other hand, if a6 = 1, then af — oe = 0 or, which is the same, 


a (B _ =) = 0; however, since a # 0, it follows that B = a (here, we 
wrote a — B to denote a + (—f6), where —8 = (—w)+ (—x)i+(-—y)j + (—z)k 


ifB =w+xi+yjt+zk). 
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In the complex plane, let a, b, c, d and e be the complex numbers associated 
to the vertices of P and assume, without any loss of generality, that Jig = 
|b — c|. Use a geometric argumento to show that we can suppose d = 0 and 
e = 1. Then, observe that the stated hypotheses are equivalent to |a|*, |b|?, |c|?, 
la — 1|?, |b —1]?, |e — 1], |a — b/? and |a — c|? being all rationals. Conclude, 
with the aid of the result of Problem 2, that Re(a), Re(b), Re(c), Re(ab) and 
Re(a@) are rationals and, hence, that Im(a)* = |a|? — Re(a)? is rational. Since 


Re(ab) = Re(a)Re(b) — Im(a)Im(b) 
= Re(a)Re(b) + Im(a)Im(d), 


deduce that Im(a)Im(b) is rational; analogously, show that Im(a)Im(c) is also 
rational. Now, since (once more from Problem 2) |b — c|? = |b|? + |e|? — 
2Re(bc), it suffices to show that Re(bc) is rational. To this end, note that 


Re(bc) = Re(b)Re(c) + Im(b)Im(c), 
and then it is enough to show that Im(b)Im(c) is rational. For what is left to do, 
write 
Im(a)Im(b) - Im(a)Im(c) 


Im(b)Im(c) = ince)? 


Section 13.2 


1S) 


. Adapt, to the present case, the proof of Corollary 13.8. 
. Start by observing that @ + L = —1, w** = 1, o**! = w and w+? = w for 


every k € Z. 


. Put 1 + 3: in polar form and use the first de Moivre’s formula. 
. Take conjugates in the second equation to obtain Zz; + Z2 + z3 = 0. Then, use 


the first equation and item (d) of Lemma 13.2 to conclude that 2 + . + - =0 
and, hence, that z}z2 + z1z3 + z2z3 = O. Finally, observe that 0 = z)(z1z2 + 
2123 + 2223) = zi (zo +73)+1= -2 + 1, with identical relations being true 
for z2 and z3. 


. Adapting to the present case the solution given to Example 13.17, write z—1 = 


1+Ro 
1—Ro’* 


oe 2R (2-*) = 2R ( wo—R ) 
"Re RS Rey R= 1 Soe) 


Rw(z+ 1), where w is an n-th root of unity, so that z = 


Now, compute 


and take modulus on both sides. 


. Write a, = (n—o)(n—o”), with @ = cis %. Then, recall that 1+@+o* = 0, 


from which we obtain (k — 1 — w)(k — w”) = (k +. w”)(k — @”) = k* — w and, 
analogously, (k — 1 — w*)(k — w) = (k+ @)(k —@) =k? — a. 


. Letting w? = p* — 4q?, use Bhaskara’s formula to show that the roots of 


the given equation have the same modulus if and only if Re(pw) = 0. Then, 
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substitute p = rcisa, q = scisB and w = tcisy into w* = p? — 4q’, with 
r, 8,t € R;, and conclude that |a — 6| must be an integer multiple of z. 


. Write z2 = 25 + 21 and z3 = 24 + z; to reduce to the case z; = 0; then, write 


25 = wz, and z3 = WZ» for an appropriate complex number w of modulus 1, 
to reduce to the case z, = 1. 


. Setting w = cis am use the first formula of de Moivre to compute the real and 


imaginary parts of the left hand side of (13.20). 


. For item (a), adapt the hint given to the previous problem. Do the same with 


item (b); you may find it helpful to use that sin? x = 5(1 —cos(2x)). 
Perform twice the substitution z + wz +a in the stated equality. 
Let O be the center and R the radius of the circle circumscribed to the 


polygon. By looking to the plane as the complex plane, we can assume that 


O corresponds to 0, A; to R, A2 to Ra, with w = cis 2 and P to z. Hence, 


2n? 
Ag = Rok! for 1 < k < 2n, and 
——2 _ 
Az P = |Rw* hg 


= R? + |z|? — 2Re(Rw*'z). 
Analogously, Aga = R* + |z|? — 2Re(Ro ae Z)s so that 
n 9 n > 
Y> AxP = > Ax-1P 
k=1 k=1 
if and only if 


n 
- Re(R wz => Re(R wz 
k=1 
In turn, this is the same as asking that 
n n 
Re( RE) orl) = Re( RZ > om). 
k=1 k=1 


Finally, show that )“t_, @—-! = yvt_, w@**-? = 0. 


Section 13.3 


Noe 


. Fora givenn € N, U,(C) is a finite subset of C satisfying both conditions. 
. Start by using the finiteness of A to show that all of its elements are roots of 


unity. Then, if z € A and zk = 1 for some natural k, show that k | n. 


. You may use item (d) of Problem 21, page 220, together with the fact that g(n) 


is even for n > 2 (cf. Problem 19, page 220). 
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4. For item (a), since d divides both m and n, substituting each piece 1 x m by 7 
pieces | xd and each piece | xn by 7 pieces 1 xd, we conclude that the rectangle 
can be assembled by using only pieces | x d. The rest follows from Klarner’s 
theorem. For item (b), let g be a natural number such that g > mn. According 
to item (a) of Problem 20, page 179, gq can be written as g = mx + ny, with 
x,y € N. Therefore, if p,q € N are such that g > mn, then we can assemble a 
Pp X q rectangle by forming p rows | x q, with each row composed by x pieces 
1 x m and y pieces | x n (x and y as above). However, if m,n { p,q, then 
Klarner’s theorem assures that one cannot assemble a p x q rectangle by using 
only pieces 1 x m or only pieces | x n. 

5. Arguing as in the proof of Klarner’s theorem, write x! y/ inside the 1 x 1 square 
of the chessboard situated at row i and column j, for 1 < i,j < 8. After we 
cover the board, assume that the 1 x | square is the one situated at row k and 
column /. Use double counting to get 


I+2+xE(x, y+ 0+ y+ y)F Oy) +2*y' 
H (etx +e 4x8) (y ty? +---+y%), 


for some polynomial expressions E and F in x and y. Let w = cis at Substitute 
x = y =o to get w**! = | and, hence k +1 = 0 (mod 3). Substitute x = w”, 
y = w to get w+! = | and, hence 2k + / = 0 (mod 3). Finally, conclude that 
k,l = 0(mod3), and show that any one of the choices (k,/) = (3,3), (3, 6), 
(6, 3) or (6, 6) do correspond to actual possible positions of the 1 x 1 square 
(i.e., show that the remaining 63 squares of the chessboard can be filled with the 
21 given rectangles | x 3. 

6. Adapt, to the present case, the proof of Klarner’s theorem. 

7. Once more, adapt the proof of Klarner’s theorem to the present case. 

8. Use the result of the previous problem. 


Section 14.1 


2. The results of items (a) and (b) are respectively X” — a” and X” + a”. As for 
item (c) recall that, according to Example 4.12 of [8], every natural number can 
be uniquely written, up to reordering, as a sum of powers of 2 with nonnegative, 
pairwise distinct integer exponents. 

3. For (X + 1)”(X + 1)”, note that each product that contributes to the coefficient 
of X? comes from a product of the form aX* .bX?-*; now, note that a = (7) 
and b = Fe a): 

4. Start by establishing the commutativity and distributivity. For the associativity 
of multiplication, start by using the distributivity to show that, if f(ghi) = 
(fg)hi and f(gh2) = (fg)hz, then f(g(r1 + ha)) = (fg)(hi + hz). Then, 
iterate such a reasoning to show that it suffices to establish the (obvious) 
equality aX” (bX" -cX?) = (aX™ - bX") - cX?, for all a,b,c € K and 
m,n, p € Zy. 
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5. 


oo 


10. 


By contraposition, show that if f, g € K[X] \ {0}, then fg + 0. To this end, 
write f(X) = So, a;X' and g(X) = )%_) bj X/, with ax, by A 0, and 
examine the coefficient of X**! in fg. 


. Letting g(X) = f(x) + 7 we are left to finding g, of degree 2001, such that 


g(X)+g(1—X) = 0or, which is the same, g(X) = —g(1—X). Setting g(X) = 
(X —a)?l, we must have (X — a)?! = —(1— X — a)?! = (X+a—1)71. 


. Apply the multisection formula. 
. Adapt the proof of Theorem 14.10 to the present case. 
. Apply the result of the previous problem to 


fH\=K+D"=>) (‘x 


k=0 k 


n=4,r=landz= V2. 

Let ao be the constant term of f and write f(X”) = g(X)A(X), with 
g,h € R[X] \ R being such that their nonvanishing coefficients are positive. 
Let g(X) = ) jo b)X! and h(X) = Yri_y cj X/. Since ap # 0 and ag = boco, 
we have bo, co # 0. Therefore, bp, co > 0, and hence ap > 0. Suppose, for the 
sake of contradiction, that g has a term of the form br X‘, with b; > O and m { t. 
Then, the coefficient of X' in f(X”) is 


> bic; — > bicj + bcg > 0. 


i+j=t it+jst 
i,j=0 i>0,j7>0 


Now, reach a contradiction and finish the proof. 


Section 14.2 


. Compute (X? + X + 1)” by writing X7 + X + 1 = (X* — X +1) + 2X. Then, 


use the uniqueness part of the division algorithm. 


. Write X" +1 = X" —14+2 = (X")"/" — | +2. Then, use the result of item 


(a) of Problem 2, page 355, to show that X” — 1 divides (X”)”"/" — 1. 


. Start by noticing that 


x2” = | - xe"! re Dox"! - 1) 


qm —2 qm —2 


ve (x2 4 Nix of (x : 1 ae. 


so that X2" + 1 divides X2” — 1. 


. Witha = -2 in Example 14.15, we get 


b b 14 b 
f@Q)=(k+ 7 aX) nad (-2) = (aX + bya q(X) + f (-) 
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5. Write f(X) = (X + 2)qi(X) — 1 = (X — 2)q2(X) + 3, with gi, go € OLX]. 
Then, write qi)(X) = (X — 2)q(X) +r, withr € Q, to get f(X) = Co 
4)q(X) + (X + 2)r — 1. Finally, use the uniqueness part of the division algorithm 
to conclude that r = 1. 

6. Write f(X) = (X + 1)(X? + 1)q(X) + (aX? + bX +0), for some a, b,c € R. 
Then, show that the remainders of the divisions of aX? + bX +c by X + | and 
X? + 1 are respectively equal to a — b + c and bX + (c — a). Finally, apply 
the uniqueness part of the division algorithm twice to obtain a linear system of 
equations in a, b and c. 

7. Assume, without loss of generality, that m > n. Follow the division algorithm 
to obtain the quotient X”~" — X™~2" 4... 4+ (—1)¢-!X"+" and the remainder 
(—1)7X" + 1. Alternatively, if g is odd write 


X™ 41 =X" + 1 = (X" + 1)X" + (-X" +1) 


and factor X"7 + 1 = (X")?7 + 1; if q is even, write 


X™ 415 X77 4 1 = (X" — 1)X" + (X7 +1) 


and factor X"4 — | = (X2")4@/2 — | 


Section 15.1 


1. Firstly, show that it suffices to consider the case in which f is of the form 
f(X) = aX", for some a € K \ {0}, and g is nonconstant. Now, in order to 
show that fog og = f 0%, use Corollary 15.14. 

2. For item (a), use the binomial formula; for (b), write f(X) = cy»X” +--+ 
c,X + co and use (a) to infer the existence of rational numbers A and B such 
that f(a + b./r) = A+ B,/r. Then, apply the result of problem 1.3.3 of [8]. 

3. Use the result of the previous problem to conclude that the given polynomial is 
divisible by X* — 2X — 1. 

4. Use the division algorithm to write f(X) = as + b)q(X) +r, for some 
constant r. Then, perform the substitution X b> —2, 


5. Start by observing that, if such an f does exist, ies f(y)? = 1— y? for every 
0 < y < 1 and, hence, for every y € R. Then, conclude that 0f = 1 and arrive 
at a contradiction. 

6. Use the root test for +7, together with the first de Moivre formula. 

7. Ifn > 1, write X"+! — x" 4-1 = X"-!(x2- X +1) —(X""!— 1) to conclude 
that it is necessary and sufficient to have X”~! — 1 divisible by X? — X + 1. 
Since X*—X+1 = (X+o)(X +o”), withw # —1 being acubic root of 1, we 
are searching for the naturals n > 1 such that (—w)"~! = 1 and (—w?)""! = 1. 

8. Assume that the given polynomial has an integer root, say r, so that 


r4 — 1994r3 + (1993 + m)r? — 1Ir +m =0. 
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10. 


11. 


12. 


13. 


14. 
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Look at such an equality modulo 2 to conclude that both m and r are even. Now, 
suppose that a and b are distinct integer roots of the polynomial. Termwise 
subtract the equalities 


a* — 1994a? + (1993 + m)a” — 1la+m=0 
and 


b* — 1994b? + (1993 + m)b? — 11lb +m =0, 


and factor a — b out of the result to obtain 
(a + b)(a? + b*) — 1994(a? + ab + b*) + (1993 + m)(a + b) = 11. 


Then, reach a contradiction. 


| Write X2-— X —1 = (X—w(X — v), with = 1495 andy = 1S. 


Invoke the root test to conclude that it suffices to find all a,b € R such that 
au? +bu!®©+1 =O andav!7+bv!©+1 = 0. Multiply the first relation by u!®, 


16 yl6 


u—-v 
a a , use (3.4) to conclude that (x,)n>1 is the Fibonacci sequence. Then, 


conclude that a = 987 and compute the value of b in an analogous way. 

Write p(X) = X(X —a2)...(X —ay), with a2, ..., @, being pairwise distinct 
nonzero integers. For d € Z, show that p(p(d)) = 0 if and only if d = a; 
for some | < i <n or p(d) = a; for some 2 < i < n. In this last case, we 
have d 4 0; moreover, writing @ = @;, show that there exists g € Z* such that 
d(d — a)q = a. Conclude that dq + 1 = —1 and, hence, that d = §. Finally, 
use this to deduce that (d — a2)... (d — a@,) = 2, and use the fact that n > 4 to 
reach a contradiction. 

Up to reordering, there are two distinct cases: (i) f(p1) = f(p2) = f(p3) = 3 
and f (ps) = —3; (ii) f(p1) = f (pr) = 3 and f(p3) = f (ps) = —3. In case 
(i), show that f(X) = a(X — pi)(X — p2)(X — p3) + 3 and, then, use the 
condition f(p4) = —3, together with the fact that the p;’s are distinct primes, 
to reach a contradiction. In case (ii), show that f(X) = a(X — p1)(X — p2)(X 
q) +3 and compute f (0) to conclude that gq = ree then, use the conditions 
Ff (p3) = f (pa) = —3 to reach a contradiction. 

Use the root test to conclude that p(X) = 5+ (X — a)(X — b)(X — c)(X 
d)q(X) for some g € Z[X]. Then, assume that there exists m € Z as in the 
statement and let x = m in the corresponding polynomial functions to reach a 
contradiction. 

If w = cis ar then w and w” are cubic roots of unity and are the roots of 
X? + X + 1. Use these two facts to find out the natural numbers k for which 
wk +1 + (w+ 1)%* = 0 and (w2)* +14 (a? + 1)*%* =0. 

Start by defining f(X) = ~"_, X% and g(X) = )7_, X”, and computing 


the second by v!® and subtract the results to obtain a = 


. Now, letting 


F(X)? = fF) AVE XG and g(X)? = g(X?) +2 V0 XM, 


i<j i<j 
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Then, use the stated conditions to show that | is a root of f — g and (f(X) + 
g(X))(f(X)—g(X)) = f(X7)—g(X7). Writing f (X)—g(X) = (X-1)F A(X), 
with k € N andh € Z[X] such that h(1) 4 0, obtain f(X7) — g(X?) = 
(X2 — 1)‘h(X2) and, hence, 


(F(X) + g(X))A(X) = (X +: 1)FA(X?). 


Finally, evaluate both sides at x = 1 to getn = 2*-!. 


Section 15.2 


1. Were a an integer root of f, show that the fact that f (0) is odd, together with the 
rational roots test, would give a to be odd. Then, show that if a is odd, then f(a) 
and f (1) have distinct parities, so that f(a) would be odd and, hence, we could 
not have f(a) = 0. 

2. Letting r be the common difference of the arithmetic progression, write x = 
y —r,z=y +r and conclude that the polynomial X° — 10X*+ — 20X? — 2 has 
the rational root x /r. Then, apply the test for rational roots to get a contradiction. 

3. Since {2cos0; 6 € R} = [-2, 2], which is an infinite set, Corollary 15.10 
guarantees that there is at most one polynomial f,, € R[X] satisfying the stated 
condition. 

4. Start by observing that the desired property holds if and only if z” fn(z + 1) = 
2°" 41 for every z € C\ {0}. However, letting f,(X) = Yi=0 aj X/, the equality 
above is equivalent to the polynomial equality Vi=0 ajX mJ (Nee Tl a NO 
1. Thus, it suffices to establish the desired equality for at least n + 1 distinct 
complex values of z. To this end, letting z = cis@, we get t = cis (—@) and, 
hence, 


1 1 
fn(Zt+ -) = fn(2cos 0) = 2cos(nO) = cis (nO) + cis (—nO) = z" + = 
Zz Zz 


5. For item (a), note that one also has x = 2y? — 1, and hence 2(y? —x)=x- y. 
Thus, either x = y or 2(y+x) = —1, whence 2x*—x—1 = Oor4x?4+2x—1 =0. 
For item (b), if x = cos 0, then 2x2— 1 = cos(26) and 2(2x2—1)?—1 = cos(46). 
Hence, we are left to solving cos(49) = cos@, and basic Trigonometry gives 
x = cos@ = cosQ, cos (22), cos (3) or cos (27). Finally, compare these value 
with those found in item (a) to get cos (4) = HS 

6. Leta = Va+ Vb+Va — Vb. Use the fact that (w+-v)? = u3+03+3uv(u+v) 
to compute a? = 2a+3/a2 — ba. For item (a), write (a?—2a)? = 27(a2—b)a3 
and apply the rational roots test to the polynomial X? — 6a X° — (15a”—27b) X3— 
8a°. For item (b), write a? — b = n°, with n € Z, to get a? — 3na — 2a = 0. 
In order to get the result of i., search rational roots in the polynomial X* — 
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3nX — 2a. For ii., first write a = 2m, with m € Z, to get a = 4m? — 3mn 
and b = a* — n>? = (4m? — 3mn)* — n°. Now, write a = 2m + 1 to get 
2a = (2m + 1)(4m? + 4m + 1 — 3n); hence, n must be even, say n = 2n' +1, 
with n’ € Z, so thata = (2m + 1)(2m? + 2m — 3n’) and b = a*—n? = 
[(2m + 1)(2m? + 2m — 3n’)]? — 8(n’)?. 

7. If (2 +i)" = (2 — i)", show that (3 + i)" = 1, so that 2 + 3/ = w* for some 


5 


: : + 2. 2k 3 
integer | < k <n, with m = cis =F. Conclude that cos =" = = and apply the 


“na 
result of Theorem 15.21. 
8. For item (a), let f,(X) be the product of all 2” factors of the form 


X= (4 Vat V@t--+ Va). 


Then, f, is certainly monic and has the given number as one of its roots. We 
make induction on n to show that f, € Z[X], noting that this is a straightforward 
computation for n = 1. Assuming its validity forn = k, let a), a2, ..., Ak, A+ 
be given as in the statement of the problem. Then, 


fer (X) =] ] (X+ Jae Jay £ +++ + Jax) 
[| (* - Va + Jay +--+ + Jax) 
= fi(X + Jacyi) fe(X — Vacs), 


with the above products ranging over all 2 possible choices of + and — signs. 
Now, direct computation gives 


fe(X = Sani) = 8x(X) + Sagyihy(X), 


for some gx, hk € Z[X], with gg monic, dg, = 2* and dhy < 2* — 1. Thus, we 
obtain 


fice (X) = (ge(X) + JSaeihe(X)) (ge(X) — aerihy(X)) 
= gy (X)* — ayyihg(X)’, 


and it is now clear that f,1 has integer coefficients. For item (c), the rational 
roots test, together with (a), shows that the given number is either an integer 
or an irrational. However, the estimates 1.4 < J2 < 1.5, 1.7 < V3 < 1.8, 
2.2 < J/5 < 2.3, 2.6 < J/7 < 2.7 and 3.3 < VII < 3.4 give 11.2 < J/2+ 
JV34+V754+J/7+ V711 < 11.7, so that it is not an integer. 

9. On the contrary, suppose the two polynomials have a common root z, which is 
not rational (for X° — X — 1 has no rational root). Then, 
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z+1 =2(2*)? = 2(—az — by? = z(a*z* + 2abz + b*) 


(—az — b)a*z + 2ab(—az — b) + b*z 


a>(—az — b) — 3a*bz — 2ab? + bz, 


so that (1 — a* + 3a2b — b?)z = a*b — 2ab* — 1. However, since a, b € Q but 
z¢Q, this gives 


at —3a*b + b* = a°b — 2ab* = 1. (22.5) 


ee 3p— _ : 
The second equality gives a 4 0, hence b* = opt Substituting such a relation 


in the first equality, one gets a+ — 3a7b + Lal = 1, thenb= 22a Putting 
this back into the second equality of (22.5), we finally obtain 


(ee ; =. , 
a — 
“ 5a3 5a3 


This is equivalent to a 10 440° 11a? 4a" 4g 1 = 0, which is an absurd, for 
the rational root test assures that the polynomial X!°+3xX°—11X°—4x?—4x-1 
has no rational root. 


Section 15.3 


1. Show that the deduction of Bhaskara’s formula (cf. Section 2.3 of [8], for 
instance) still holds in the realm of complex numbers. 

2. For the first part, the roots of w= us are ug, Upw and ugw~; now, review the 
discussion of Example 15.22. For the second part, if up + vo = ugw + vow, 
show that uo + vo(@ + 1) = O and, hence, that up = a(w + 1). Then, use 
the fact that u§ + bug — a> = 0 to obtain the relation bug(@ + 1) = 2a”. 
Finally, square it and substitute us = a(w+ 1) to arrive at 4a7 + b? = 0. Argue 
in an analogous way to investigate the equalities uo + v9 = ugw* + vow and 
ugw + vow = ugw~ + voo. 

3. Item (b) follows from applying elementary algebra to the equality of item (a), 
together with the fact that z4 + az? + bz* +.cz+d = 0. For (d), Example 15.22 
shows that one can always find w as in the first part of item (c), and hence solve 
the quadratic equations displayed in the second part of item (c). Upon doing 
this, show that z will be a complex root of X* + aX? + bX* +.cX +d. 

4. For the first part, write f(z) = anz” +--+ +a z+ ao, with ao, aj,...,d, ER 
and use items (b) and (c) of Lemma 13.2 to conjugate both sides of the equality 
f (2 = 0. For the second, make induction on the degree of f; for the induction 
step, note that, for z € C\R, one has (X — z)(X —Z) = X*—2Re(z)X + |z|? € 
R[X]. 
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. Use the result of the previous Problem 4. 
. Use the proof of the previous problem to write f(X) = u(X)v(X), with 


u(X) = []h_\(X — <j) and v(X) = []j_,(X — 2)) for some zi, ..., 
ze € C\R. Now, note that if u(X) = g(X) — ih(X) for some g,h € R[X], 
then v(X) = g(X) +ih(X). 


. Writing f(X) = a,X" + Gn) X"—! +--+ +a,X +o, with ay + 0, it follows 


from f(a) = 0 that 


an-1 = a| ao 
eo = n -Q” 1 oe a ; 
an an an 
Now, argue by induction to prove (15.8) for m > n. For the case m = —1, 


expand the equality a~! f(w) = 0 in terms of the coefficients of f; for the 
general case m < 0, use induction again. 


. Apply the triangle inequality to get, for |z| > 1, 


lanllzl" < lan—allzl"! +--+ + laillzl + lao] <n Alzi"! <nAlz|"!. 


. Letting z = a+ fi be acomplex root of f such that a > 2, adapt the hint given 


to the previous problem (i.e., apply the triangle inequality) to obtain |a,z” + 

n-1 
dn Z| < co . Subsequently, use this to show that |z| < 1+ eee 
and, then, substitute z = a + if. 


For the case of degree 4, if z € C is a root of f, then z #0 anda (2 + +) + 


b (z + 1) +c = 0; then, let w = z+ i and apply the result of Problem 4, 
page 378, for n = 2. For the case of degree 6, argue analogously, starting with 
f(X) =aX® + bX> +cX44+dX3 4+ cX? + bX +4, witha £0. 

Use the stated conditions to get (X + 1) f(X) — X =aX(X —- 1)...(X —n). 
Then, compute a by evaluating both sides at —1. 

As in Example 15.26, substitute X by | in the factorised form of X"~!+X"~-?4 
+--+ X + 1; then, use some Trigonometry to show that, if @ = cis an then 
[1 — o*| = 2sin™ forl <k <n. 

Use the result of the previous problem, together with the relation sin(z — x) = 
sin x. 

If z € Cis a root of p, show that z* and z — 1 are also roots. Then, use the 
finiteness of the number of roots to successively conclude that |z| = 1, |z—1] = 
land z = wor z = @, where w = cis a Finally, apply the result of Problem 4. 


Section 15.4 


if 


Use the result of Corollary 15.32, pretty much along the same lines of 
Example 15.33, to show that 8a? — 25a” — 180a + 608 = 0. Then, conclude 
that a = 4. 
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Write f(X) = g(X)7h(X) and, then, compute f’. 

Use item (b) of Proposition 15.29 to show that f’(z) = ae =i za e 
mT 

Adapt, to the present case, the hint given to the previous problem. 


For item (a), let f(X) = X" -—1L= nearness —q/) and check that 


(wf —1)...(@! — ow")! — w/t)... (af — @"!) = f'(@!) = no, 


For item (b), note first that the substitution X te O in (15.7) gives 
ww ...a@—! = (—1)""!n. Then, 


n—-1 


P a OE =T..te a0 Ser? =")... a) 


n—1 


= [ [co *@ _ 1) —_ io” _ ok!) (ok _ ory . (ok = re ame 


k=0 


n—-1 
n(n—1) _ (n=1)(n=2) 
=(-1) 2 [[no®™* =(-1) 7 nt". 
k=0 


. For item (a), set b, = Aa”! + BB"-! + Cy"! for every n > 1, with A, B 


and C being the solutions of the linear system 


A+B+C=a 
a? A+ PPB + y2C =43 


show that (by ),>1 satisfies by43 = ubp42 + vbg41 + wh, for every k > 1, and 
that a, = by for 1 < k < 3. Then, show that a, = b, for every n > 1. Items 
(b) and (c) can be dealt with in similar ways, except for the fact that, in order to 
establish the equality by43 = ubg42 + vby+1 + wh, you will have to use the 
fact that X? — uX? — vX — w has multiple roots. 


. First of all, find the roots of X? — 6X? + 12X — 8 (look for integer roots). Then, 


apply the result of the previous problem. 


. Letting f(X) = TT, (1 * tx), show that f(1) =n + 1and 


f(M=1+ YO x7, 
O#SCIy = 


Now, compute f’(1) and, with the aid of the result of Problem 4, f aoe 


. It suffices to show that, if some half-plane of the complex plane’ contig the 


roots of f, then it also contains those of f’. By the sake of contradiction, 


612 
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11. 
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suppose that there exists a line r such that one of the half-planes it determines 
contains a root w of f’, whereas the other contains all of the roots of f. Let 
u € C have modulus | and such that the vector u is perpendicular to r; if 
f(X) = a(X — 21)... (X — Zn) and @ and 6; respectively denote arguments of 
u and z; — w, use the result of Problem 3 to show that 


n 
Re | ) > |zj — wl! cos(@; — 6) | =0. 
j=l 


Then, note that (z; — w)/u has an argument belonging to the interval (— 4, 4), 
and use the equality above to reach a contradiction. 

tf) = Ye cx. X*, with cy € Z, then f(X) = an ck(a + (XxX — a))*. 
Expand all of the binomials (a + (X — a))* and compare the result with (15.9), 
for z= a. 

Make induction on k > 1. For the induction step, if mz € N is such that 
f (mg) = O(mod p*) and f’(mz) # 0 (mod p), let me, = me + xp*, with 
x € Z to be found. Then, use Taylor’s formula (15.9), together with the result 
of the previous problem, to show that 


f(me+1) = f (me) + f (mp) xp* (mod p**!). 


Subsequently, show that it is possible to choose x so that f(mz41) = 
0 (mod or), Finally, since mg4, = mz (mod p) and f’ € Z[X], we have 
f' (mest) = f' (mg) # 0 (mod p). 

Start by showing that one can assume that the center of the square is the origin 
of the cartesian plane. Then, if (u, v) is one of its vertices, note that the other 
ones are (—v, u), (—u, —v) and (v, —u). Substitute these points in the defining 
equation for the graph, ie., y = f (x), to obtain au? +c = 0, v =u? + bu and 
av’ +c = 0,u = —v°—bv. Conclude that a(u — v)(u+v) = 0 and, hence, that 
either a = c = 0 oru = £v and au + c = 0. Discard the possibility u = +v 
and successively obtain a = c = 0 and u2(u2 + b)> + bu? +b) +1 = 0. 
Conclude that the polynomial be Oe + b)3 + b(X? +b)+ 1 = 0 has only two 
real roots, so that X (X + b)3 +b(X +b)+1 = Ohas only one real root a = u- 
(apart from multiplicities). Thus, show that such a root must be a multiple one, 
and therefore that (X + b)3 +3X(X+ b)? +b = 0. Letting A = a+), perform 
some simple algebraic manipulations to arrive at (16 — 2b”)bA = 2b? — 16, so 
that b> = 8 or bA = —1. Discard the possibility bA = —1 and conclude that, 
if b* = 8, then (bA)? — 4(bA) — 8 = 0. Finally, if 2 stands for the length of the 
sides of the square, show that 
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= 2(u? +0? (uv? + b)’) = 2(A — b)(A? + 1) 
_ 2bA—b?)((bA)? +b?) — 2(bA — 8)((BA)* + 8) 
7 b 7 8b 


2 = 
= (DA) —e) 32 _ a. 


Section 16.1 


1. Adapt, to the present case, the proof of Proposition 16.1. 
2. For item (a), adapt the argument of the solution of Example 16.3. For item (b), 
start by writing 


(CaF = 2 He = ae = Fe -  , 


with f of degree 2. Then, use the equality —y> + (y —zP +2 = —yz(y — 
z) f (0, y, z) to show that f(0, Y, Z) = 5(Y* — YZ + Z?) and, analogously, that 
f(X, 0, Z) = 5(X? — XZ + Z”) and f (X, Y, 0) = 5(X? — XY + Y”). Finally, 
show that 


FOE 2) SSO PS RY aT = FZ). 


3. Letting f = f(X, Y, Z) stand for the given polynomial, adapt the reasoning used 
in the solution of Example 16.3 to obtain 


F(X, Y, Z) = (X + Y)(X4 Z)(V + Z)g(X, Y, Z), 


with g € R[X, Y, Z] of degree 2. Now, use the equality f (0, y, z) = yzg(0, y, z) 
to show that g(0, Y, Z) = Y°+YZ+Z? and, likewise, g(X, 0, Z) = X7+XZ+ 
Z? and g(X, Y,0) = X*+ XY 4+ Y?. Finally, obtain 


XY, Zak 4 4 FRY HAZY TZ. 


Section 16.2 


1. Use the symmetry of f and g to show that, for a fixed permutation o of I, 
we have h(x),...,X%,) = A(Xo(1),---,Xo(n)) for infinitely many elements 
X1,...,4Xn, € K. Then, use the result of Problem 1, page 398, as well as that of 
Proposition 16.1. 

3. For item (a), apply the definition twice. For (b), just note that all factors X; — X ;, 
for | <i < j <n, will appear in f° as +(X; — X;). For (c), use the result of 
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(a). Item (d) follows straightforwardly from (c). For item (e), and writing t in 
place of ty; for the sake of simplicity, we have 


f= [I w— Xe) 
l<i<j<n 
= [] ®w- XW): T] Xm - Xr): 
lsi<j<n k<jAl 
i jek 
[[ Xo — Xe) T] Xcw -— Xr): 
l<j i<k 
-T] (Xr@ — Xm) + (Xr — Xe) 
k#i<l 


Finally, for (f), show that the map o +> 0 07}? is a bijection from Ay to S;,\ An. 

4. If S, stands for the set of all permutations of J, and t € S,, use the fact that 
o +> to defines a bijection on S,. 

5. Adapt the argument given to the proof of Example 16.7. 

6. Letting e+bh+ea= k, we have a> + 3a” = 3k — 25, so that a is a root 
of f(X) = X3 + 3X? 4+ (25 — 3k); analogously,b and c are roots of such a 
polynomial. Now, factorise f and use Girard-Viéte relations to conclude that 
a+b+c=-—3and ab +ac + bc = 0, and hence that a2 + b* + c? = 9. 

7. Letting z1, Z2, z3 be the complex roots of the given polynomial, we want to find 
the coefficients of f (X) = (X — z})(X —z53)(X — z3) in terms of a, b and c. To 
this end, use Girard-Viéte relations, together with the result of Example 16.3. 
For instance, letting g(X) = X3 + aX” + bX +c, we have g(X) = (X - 
Z1)(X — z2)(X — z3) and, hence, 


a+at re = (21 + 22 +23)? — 3(z1 + za) (zi + 23) (Za + 23) 


= (-a)? — 3(—a — z3)(—a — 22)(—a — 21) 


a> — 3g(—a) = —a? + 3ab — 3c. 


8. First of all, show that if a, b and c are such roots, then the hypothesis of a, b and 
c being positive assures that a, b and c are the lengths of the sides of a triangle if 
and only if (a+b—c)(a+c—b)(b+c—a) > 0. Then, substitutea+b+c = —p 
at the left hand side, expand it and use the fact that ab + ac + be = q and 
abc = -r. 

9, If f(X) = (X — a)(X — b)(X — oc), then f (X) = X? — p, with p = abc #0. 
Now, argue as in the hint given to Problem 7, page 386. 

10. Apply the result of Example 16.7 to the polynomial g(X) = X! 4 2x 4 

3X°8 +... + aogX? + aogX + ajog. Then, show that its roots are the inverses 
of the roots of f. 


22 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
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Letting ax + by = c denote the equation of a straightline satisfying the stated 
conditions, substitute y = —2x — £ in the equation defining the graph of f 
and, then, use Girard-Viéte relations. 

Let (x — a)? + (y - b)* = R* be the equation of the circle. Substitute y = 1 
and, then, use Girard-Viéte relations. 

For item (a), apply Corollary 15.32. For (b), use the fact that a7 = a” +a+1, 
and analogously for b and c. Finally, for (c) use (b) and Girard-Viéte relations. 


For item (a), use induction. For item (b), apply the result of (a), noticing that 


f (KX) =X" + 9X") $F sg XP bet 5p 1X + Sn. 


Make induction on n > 2 to conclude that xj = x2 =... = x, = 1. For the 
induction step, if 


(XK) = (KX — x1) (XK — x2)-+- (KX — xn) 


=X” gy XO ee hye tag, 
note that 


O = f(x) + fa) +-+>+ FOn) 
=n+ay,—in+---+tajn+aon =nf (1). 


Firstly, show that the roots of f are negative. Then, apply Girard-Viéte 
relations, together with the inequality between the arithmetic and geometric 
means, to deduce that a, > ({) forl<k<n—1. 

Letting x}, ..., x, denote the roots of f, use Girard-Viéte relations to conclude 
that 


n 


n 
Sn — (Sou) -2 > xjxj =3 
i=1 


i=l l<i<j<n 


and [T/_, x = |. Then, apply the inequality between the arithmetic and 
geometric means to show that n < 3. Finally, separately consider each of 
the cases n = 1, 2 and 3; in the case n = 3, you may wish to use some 
Calculus (namely, the study of the first variation and/or the intermediate value 
theorem) to find out which of the possible polynomials have only real roots. In 
this respect, see [8] or the material of Sect. 17.1. 


Section 16.3 


1; 


For 1 < k <n, use Newton’s theorem and Girard-Viéte relations. For k > n+ 1, 
use item (a) of Proposition 16.17. 
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2. Use item (b) of Proposition 16.17 to prove by induction on j that, for 1 < j <n, 
the j-th elementary symmetric sums of aj,..., ad, and bj, ..., by, coincide. Then, 
compare the coefficients of the polynomials [';_,(X — aj) and []/;_)(X — bj). 

3. Apply item (b) of Proposition 16.17 to prove, by induction on j, that s; = (‘) 
forl <j <k. 

4. For item (a), factorise X”” — z” over C and use the result to factorise g over C. 
For (b), use Newton’s theorem to show that the elementary symmetric sums of 
zi’, ..., Zh’ are integers. Finally, for (c), just note that g(X) = h(X”), for some 
he ZX]. 

5. For item (a), let f(X) = X" + Gn X"—-! +--. + a,X +a bea polynomial 
with integer coefficients and such that all of its complex roots have modulus 1; 
use Girard-Viéte relations to show that |a,—x| < (7) for 0 < k <n. For item (b), 
let a1, ..., @, be the complex roots of f. Use Newton’s theorem to show that, 
for each integer k > 1, the polynomial f,(X) = (X — a?) 2 (X — a2") has 
integer coefficients. Then, apply the result of (a) to assure the existence of natural 
numbers k < / such that f; = f). Finally, use such an equality to show that a; is 
a root of unity, for 1 < j <n. 


Section 17.1 


1. For each nonreal complex root z = a+ bi of f, write the factor (X — z)(X — Z) 
of f as in the proof of Lemma 17.1. Letting c be the leading coefficient of 
f, X1,-..,Xx its real roots and z1, Z1,..., 2, Z; the nonreal ones, with zj; = 
aj +ib;, we have 


k I 
f(%) =c[ [&-x)[[X- ax -%) 


j=l i=1 


k 1 
=c¢ [[« = xj) [| ((X — aj)” + b?). 


j=l i=1 


Use this way of writing f to estimate | f(m)|, and notice that | f(m)| 4 0, 1. 

2. Ifa < a < f are the roots of f closest to a, apply Bolzano’s theorem to 
intervals of the form (a, b) or (b, a) contained in the interval (a, 6). 

3. Arguing by contradiction, assume that f’(a) > O (the case f’(a) < 0 is 
completely analogous) and apply item (a) of Corollary 17.7, together with the 
result of the previous problem. 

4. Apply Bolzano’s theorem, Corollary 17.7 and the result of the previous 
problem, noticing that f(x) = 0 = x = 0 or §, and f(—1), f(3) < 0 < 
f(). 


22 Hints and Solutions 617 


5: 


10. 


11. 


12. 


13. 


Take no € N greater that the largest real roots of f and f’. Use Bolzano’s 
theorem, together with the fact that f(x) > O for sufficiently large x (cf. 
estimates analogous to those preceding (15.5)) to show that f(x) > O for 
x > ng. Then, use Corollary 17.7 to show that f(u) > f(v) foru > v > no. 


. Assume, without loss of generality, that f has positive leading coefficient. 


According to the previous problem, take ng € N such thatu > v > no > 
ft) > f(v) > 0. Let m > no be an integer for which f(m) = p, a prime 
number. If f(X) = a,X" + Gn 1X"! +--+» +a,X +o, then 
f (m+ p?) = an(p* +m)" +++» + ai(p? +m) + ap 
= pie (agm* + --»+ajm +9) 


= plt+ fim) = f(m)(f(m)l + 1). 


Now, check that the choice of m gives f(m + p*) > f(m), and conclude that 
f(m + p”) is composite. 


. Firstly, show that the given condition is equivalent to x!! — x = y!! — y, Then, 


prove that, for any c € R the polynomial f(x) = X!! — X — c has at most 
three distinct real roots; to this end, you shall need to use Corollary 17.7 and 
Problem 3, in a way analogous to that of the hint to Problem 4. 


. Since A ~ 0, show that f(A) = 1, where f(X) = (X — aj)(X — a)(X 


a3)(X — a4). Then, show that f decreases in (—oo, a1), so that f(0) = 1 gives 
A. > ay. Finally, note that if aj < 4 < ao, then f(A) < 0. 


. Let f(X) = aX* + bX3 +cX* +dX +e, and take t > 1 such that r? is a real 


root of aX*+(c—b)X +(e—d). Show that f (t) f(—t) = (bt? +d)(1—17) < 0 
and, then, apply the theorem of Bolzano. 

Set (©) = 1 @X4, (KX) = DF ja ee Sag and conclude that xg’(x) = 
F(x? > 0, for every x > 0. Then, use Corollary 17.7 to obtain g(1) > g(0) = 
0, with equality if and only if g is constant. 

Start by showing that f has three distinct real roots a < B < y, with —2 < 
a < —1,0 < B < landl <y < 2. Then, note that f(f(x)) = 0 if and only 
if f(x) =a, B or y. Finally, find out the numbers of distinct real roots of each 
one of the polynomials f(X) — a, f(X) — B and f(X) — y. 

Set g(x) = f(x) + f(x) + f’(x) +--- + f(x) and assume, by the sake 
of contradiction, that g attains negative values. Then, f 4 0 and the condition 
f(x) = Oforx € Rassures that n is even and f has positive leading coefficient. 
Hence, limjx|-+ +00 f(x) = +-oo, and Weierstrass’ theorem (Teorema 8.26 of 
[8]) guarantees the existence of x9 € R such that g attains its minimum value 
at xo, with g(xo) < 0. It follows from Problem 3 that g’(xo) = 0. However, 
since g/(xo) = f’(xo) + f” (xo) + --- + f™(x0), we have 0 > g(xo) = 
f (xo) + g’(xo) = f (x0) = 0, which is a contradiction. 

Firstly, show that B can play in such a way that, when exactly three coefficients 
are left to be chosen, at least two of them are coefficients of powers X", with 
an odd r. Then, after A plays, we shall have f(X) = g(X) + a,X* + a, X!, 
with g being a completely determined real polynomial, 1 < k,/ < 2n—1 being 
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14. 


15. 


distinct integers such that a, and q are still left to be chosen and / odd. Since 
fQ) = g2)+2* a, +2! a) and f(—1) = g(—1) + (—1)* ax — a, we shall have 
f(2) +2! f(-1) = g(2) +2! g(-1) + (2* + (—1)*2!)ag, so that B can play by 
_ g(2)-+2! g(-1) 
2+ (kal 

the existence of at least one real root for f, no matter how A plays next. 
Without loss of generality, we can assume that the leading coefficient of the 
polynomial f originally written in the blackboard is positive. Let a be the 
smallest and £ be the greatest real root of f (possibly with a = £). Ifa < B, 
show that f’(a) > O or f’(B) > 0; if f’(B) > 0 (the case f’(a~) > 0 can be 
dealt with analogously), conclude that the next polynomial to be written either 
has no root in the interval [6, +00) or has a root in the interval (6, +00). If 
a = B, conclude that f(X) = (X — a)? and show that both f + f’ and f — f’ 
fall into the first case. 

Setting b; = —a; for 1 <i <n, show that the condition f(x) > 1 is equivalent 


to PS < 0, with q(X) = []7_y(X — bj) and 


choosing ax = . Finally, conclude that Bolzano’s theorem assures 


p(X) =] [(x-4)+ > aL] «x -5) 
i=1 


i=l j#i 


Show that p has a root x; in the interval (bj, +00); then, use Bolzano’s 
theorem to show that p also has a root x; in each of the intervals (b;, bi-1), 


for 2 < i < n. By distinguishing the cases n even and n odd, show that 
the sum of the lengths of the solution intervals of the inequality oe < 0 


. [oie Xi — c/_, bi|. Finally, apply Girard-Viéte relations to show that 
j=| Xi =O 


Section 17.2 


. Firstly, use the definition of the derivative of a polynomial to show that if g,h € 


R[X] are such that g’ = h’, then there exists c € R for which f = g +c. Now, 
letting f’(X) = a(X +b)"~|, show that f(X) = ¢(X +b)" +c, for some c € R. 
Finally, use the hypothesis on f to conclude that f(X) = (X + a)”, for some 
aeéeR. 


. For the sake of contradiction, assume that the given polynomial has only real 


roots. Then, use Newton’s inequalities, together with Girard-Viéte relations, to 
conclude that 3a” > 8b, 4b? > 9ac and 3c” > 8bd. 


. Writing H; = Hj(a, b,c, d), the given inequality is equivalent to Hy Hz > Hs. 


F3 


Now, it follows from Newton’s inequalities that H, > me 2a, 
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Section 17.3 


. First of all, note that V( f) = V(f(—X)) <n. Then, apply Descartes’ rule twice 


to conclude that V(f) + V(f(—X)) = 2h. 


. Example 17.17 assures that f has precisely one positive root. It now suffices 


to apply Bolzano’s theorem to show that f (2 - 


st) and f (2 - mr) have 
distinct signs. To this end, letting f denote the given polynomial and ace )= 
(XxX — a note that g(X) = X"+! — 2x" + 1; then, use that 1 < 2— ot < 


2- oa to show that it suffices to prove that g (2 - st) and g (2 - mr) have 
distinct signs. 


. If f (X) = aX? +bX? +cX +d, we want to compute the number of real roots of 


g =2ff" — (f’)’. Assume, without loss of generality, that a = 1. Moreover, if 
a < B < y are the roots of f, show that changing g(X) by h(X) = g(X+8), we 
can assume (also without loss of generality) that 6 = 0 and, hence, that c < 0 
and d = 0. Under such simplifications, an immediate computation furnishes 
g(X) = 3X4 + 4bX? + 6cX? — c?. It now suffices to apply Descartes’ rule to 
conclude that g has exactly one positive root and exactly one negative root. 


Section 18.1 


NM B&B W 


. For item (i), use the fact that df; = j to conclude that a, = b,; then, argue by 


induction. For (ii), take n = Of and argue by induction on n; for the induction 
step, start by choosing a, to be equal to the leading coefficient of f. 


. Fork = 0 and k = 1 the result is obvious. For k > 2, show that () (x) = Oif 


O<x <k—1, (*)(x) = (7) ifx > kand (%)@) = (-DF(* 7) ifx <0. 


. Apply the Lagrange interpolation theorem. 

. Apply the Lagrange interpolation theorem. 

. Adapt, to the po case, the proof of Proposition 18.6. 

_Let wo = cis™. For 1 < k < n—1, substitute x = o* into the 


polynomial factions corresponding to p and to the p;’s, and use the result of 
Proposition 18.6. 


. For item (a), look at the sum at the right hand side as the result of a Lagrange 


interpolation. Item (b) follows directly from the factorisation obtained in (a). 
Finally, for item (c), start by writing 


2 es). bE -4rycr vi(" Vas 


k>1 a k>1 j=0 


1 


Then, for a fixed / > n, show that } appears in the last sum above with coefficient 


Yj-0(-1)/ 6) = 


. For each prime p in the set A = {3,5,7, 11, 13, 17}, let wa» denote the common 


value of the sums ax + dg+p + 4k+2p + --- when k varies from 1 to p, and 
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Op = cis =, If f(X) = asopX™ + --- + .a2X* +a, X, apply the general version 
of the multisection formula (cf. Problem 8, page 355, or Example 18.7) for r = 
0,1,..., p — 1 in order to obtain a Vandermonde system of equations in the p 


unknowns f (@}), for 0 < j < p—1. Then, use the result of Proposition 18.6 to 
conclude that the solution of such a system is f(1) = pa, and f(wp) =--- = 


f (we ') = 0, thus obtaining p — | distinct and nonzero roots for f. Finally, 
notice that Dipeal? — 1) =50 and that 0 is also a root of f to obtain f = 0. 


Section 18.2 


2. Let us make induction on k > 1, the case k = 1 being immediate. By induction 
hypothesis, assume that the formula holds for a certain k € N. For k + 1, we 
have: 


F@+KEDA) = fe +kh) + (ALP@ +kh) 


k 
k 24 
= f(x t+kh+>- (‘oat ALA) 


j=0 


k 


at ‘cat “p+ ¥( Jar ! f(x) 
j=0 


k 
He oats Fy) + AR AG 


ja0 
ue iar Pe: 


Now, by performing a change of indices in the last sum above and applying 
Stifel’s relation, we obtain f(x + (k + 1)h) successively equal to 


k k-1 


k ; 
ae (SJar ya) + (ARH f)(x) + » Ge Jarine 
j=0 
k+l — { (t k (k++) 
=A nw +d ((j + Gi) J+) Ayn) + (AP A@) 
j=0 
k+1 


k+1 
->(5 . Jar ‘ f(x). 


j=0 


22 Hints and Solutions 621 


3. Adapt, to the present case, the solutions of Examples 18.15 and 18.15. 
4, Since df =n, Proposition 18.14 assures that A”*+! f = 0. Hence, it follows from 
item (e) of Proposition 18.13 that 


n+1 


0 =(A™' Yo = Ye" ’ ') fa +1-j) 
j=0 
n+l 
= Yen") ') se) 
j=l J (a) 
n+l 
= Di(-bi + ft, 
j=l 
so that 
wf. 0, ifn =1(mod2) 
fa+)=-)ICd! = 1, ifn =0(mod2) * 


j=l 
5. Use item (e) of Proposition 18.13 to get the equality 


991 ae j (991 . 
0 = (Aq” f)(992) = D(-D j ) rags =a: 


j=0 
Then, use the fact that, for every n € N, we have 


n 


n 
> (") Fon41—j = Fanti — 1 


j=l 


(cf. problem 18 of Section 4.3 of [8], for instance). 
6. The given conditions assure that (At f) (0) > OforO0 < k < 3 and (At f)(n) >0 
for each n € N. Now show that, for every m € N, we have 


n-1 


AT A@ =VAT/A&+ AT! /O. 


k=0 


Finally, use this formula to successively show that ee fom) > 0, (At f)(™) > 
0, (Ai f)(@) > Oand f(n) = (AD f)(n) > 0, for everyn EN. 
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Section 19.1 


2. 


NOs WwW 


Firstly, show that pe 8. pe divides both f and g in Q[X]. Secondly, show that 


if h € QLX]\ {0} is monic and divides f in Q(X], thenh = p°!... p*q,!...q)!, 
with 0 < 6; <a; forl <i <k,and0 <5) <a forl <i <1. 


. Adapt, to the present case, the proof of Euclid’s algorithm for integers. 

. Apply Euclid’s algorithm to find X* — 3 as the answer. 

. Argue by contraposition. 

. Start by using Corollary 15.24, together with the result of Problem 4, page 386, 


along the same lines of item (b) of Example 19.7. 


. Compute derivatives to show that f”~! f’ + g”?~!g’ = h"~'h’; then, multiply 


by / on both sides to obtain (f’h — fh’) f"~! = (gh' — g/h)g"—!. Use the fact 
that f and h have no nontrivial common factors, together with Theorem 19.2 
and the result of the previous problem to show that f’h — fh’ 4 0; accordingly, 
show that gh’ — g/h # 0. Assume, without loss of generality, that df > dg. Then, 
apply again the result of the previous problem to the equality (f’h— fh’) f”~! = 
(gh' — g'h)g"—!, conclude that f"~! | (gh’ — g/h) in R[X]. Finally, use this last 
relation, together with f” + g” = h”, to obtain the following degree estimates 


(n — laf <dg+dh—1< df +2ag—1 


and conclude the solution of the problem. 


. For item (a), make induction on k, using the result of Corollary 19.4 to show that 


there exist fis Jk. © KLX] such that 


_ Z a 


e Sek ee 


with fj = Oor Of, < A(gi!...gths') and fe = 0 or dfe < A(g¢"). For 
item (b), start by dividing f by g*, thus getting f = gk +r, with r = 0 or 
O0<or< a(g*); then, divide r by g‘~! and proceed inductively. 


Section 19.2 


1. 


For the uniqueness part of item (b), let a,b,c,d be nonzero integers with 
gcd(a, b) = ged(c, d) = 1, and g,h € Z[X] \ Z be primitives, with (a/b)g = 
(c/d)h. Calculate contents in both sides of (ad)g = (bc)h. 


. Since f is reducible over Q, there exist monic and nonconstant polynomials 


gi,4, € Q[X] such that f = g hy. Take a,b,c,d € Z \ {0} such that 
gcd(a,b) = ged(c,d) = 1 and gi = fg, hi = Gg, with g,h € Z[X] 
monic and nonconstant. Take contents and apply Gauss’ lemma to the equality 
bdf (X) = acg(X)h(X) to conclude that bd = ac. 
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3. First of all, note that d is primitive. Now, for (a) => (bd), let f, g and d 
have canonical factorisations f = +f/'... f(", g = +g/''...g/" andd = 
hi! ...ny*. Show that if d’ | f, g in ZLX], then the canonical factorisation of d’ 


isd’ =h'!...h, with a! < a; for 1 < i < k. For (b) > (a), argue in an 
analogous way. 


Section 19.3 


1. For the verification of the listed properties, follow the corresponding verifica- 
tion for polynomials in K[X], with K = Q, R or C. 

2. Apply Fermat’s little theorem. 

3. For the first claim, note that 


f@m=0> f@=0> f@ =0. 
For the second, use the root test to write 
F(X) = (XK —4))...(X —%) F(X), 


for some nonzero g € Zp[X] with no roots in Z,. Then, perform the 
substitution X +> a and apply the second part of Problem 6, page 281 to 
conclude that, for some index 1 < j < k, we havea —aj; = 0. 

4. If a € Qis a rational root of f, the rational roots test shows that a € Z and 
a | 84. In order to eliminate some of the left possibilities, project f into Z3[X] 
and Zs[X] to conclude, with the aid of the previous problem, that a = 0 (mod 3) 
and a = 4(mod 5), and hence that a = —6, —21 ou 84. 

5. Make induction on 0/; for the induction step, apply the result of Example 6.42. 

6. Assume that fg is not primitive and choose a prime p that divides all of its 
coefficients. From the equality fg = fg = 0inZ plX], conclude that f=0 
or g = O and reach a contradiction. 


7. Leta be a primitive root modulo p. For item (a), show that (ak)4 # | (mod p), 
k(p 


for every integer 1 < k < p — 1. For (b), show that (a i )?4 = 1(mod P) for 
every integer | < k < d, and then conclude that 


d 
Maie| [asa = ). 


8. Use the result of Proposition 19.18, together with Girard-Viéte relations. 
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9. 


10. 


11. 


12. 
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Show that, modulo 17, we have 
X?— 3X7 41 = (X —4)(X —5)(X + 6). 


Then, let s, = a* + b* + ck and tq, = 4* + 5* + (—6)* and conclude, with the 
aid of the methods of Sect. 16.2, that 


Sk4+3 — 3S¢42 + 5, = 0 and #43 — 3te2 + th =0 
5, =3,52=0,53 =—-1 th =3,2=0,8h=-1— 


Finally, use the above to show that s, € Z for everyn € N ands, = 
tn (mod 17). 
For item (a), project on Z2[X] and apply the result of Problem 5 with p = 2. 
For (b), start by writing the binary representation of 2” — 1; then, project on 
Z2[X] and apply the result of Problem 5; finally, argue by induction on m. 

For the coming problem, the reader may find it useful to review the statement 
of Problem 11, page 163. 
For the proof of Lucas’ theorem, compute in Z,[X], with the aid of Exam- 
ple 19.15: 


(X +0)" = (XK 4 TP" XK + TY PX 4 TN” 
= (XP ATM (XP 4 TY (X 4 Tym 


Now, use the uniqueness of representation in base p to conclude that the 
coefficient of X” in the right hand side is exactly 


eee) 


For (a), if p | (”") then p | ) for some 0 < j < k; in turn, ifn; < mj, then 


feet ey ym) _ 
nj < mj; < p, and the fact that p is prime avoids (i) = to have a 


mj! 
prime factor p. Item, (b) follows from (a) and the uniqueness of representation 
in base p, since we ought to have 0 < nj < mj; for 0 < j < k, which gives 
exactly mj + | possibilities for n;. Finally, for item (c) note that potl—i= 
(p—1)p* +---+(p— pt (p— I), so that mp = m, =... =m, = p—1. 
Hence, from (b) there are pert binomial numbers of the form (? V) which 
are not divisible by p; thus, no binomial number is left to be divisible by p. 
First of all, note that the equality X’ —1 = (X—1)(X?~! + X?-74..-4X+4+1) 
gives, in Zp[X], 


(X— 1)? = (KX -1(XP 14 XP 7. X+D, 
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so that (X —1)?-! = xP-!4 xP? 4....+ X +1. Therefore, bi expanding the 
binomial at the left hand side and comparing coefficients, we get (—1)! (? :) = 
1 (mod p). Now, if 1 = p*(p — 1), then Lucas’ theorem gives 


hens Kip — —- 
(’ (p ») _ P a ”) = (’ ') = (—1)4 (mod p). 
l P*q q 


For what is left to do, note that item (b) of the previous problem assures that 


exactly p binomial numbers of the form e *@ a) are not divisible by p; since 
we have already found p such numbers (for g can vary from 0 to p — 1), the 
remaining ones are multiples of p. 


Section 19.4 


1. Firstly, show that the given polynomial has no integer roots. Then, invoke 
Gauss’ lemma and examine the possibility X4— X27 41 = (X27 +aX +b)(X24+ 
cX +d), witha, b,c,d € Z. 

2. Gauss’ lemma assures that it suffices to show that f cannot be written as the 
product of two nonconstant polynomials with integer coefficients. To this end, 
first note that f has no integer roots; hence, if it can be written as the product 
of two nonconstant polynomials with integer coefficients, we must have 


FS OO Fak +? ek 4 ak +e, 


for some a,b,c,d,e € Z. Expanding the products at the right hand side 
and comparing coefficients, conclude that be = 2, so that (b,e) = (1,2), 
(—1, —2), (2, 1) or (—2, —1). Then, check that each of these possibilities leads 
to a contradiction when compared with the remaining relations involving a, b, 
c,d and e. 

3. Write f(X”) = g(X)h(X), with g,h € R[X] \ R as in the statement of the 
problem andn > 1 (the case n = 1 is trivial). If g(X) = by X*+bp4,X*t!4.-- 
and h(X) = ¢X! + c41X't! + ---, with by, c) # 0, and changing g and h 
respectively by g(X) = by +bx41X +--+ and h(X) = X44, XM 
+++, We can suppose that g(0), (0) # 0. Therefore, from the beginning let 
g(X) = bp + bX +--+ and h(X) = c,X! + cy, X't! + ---, with bo, c7 # 0. 
Use the fact that f(X”) = g(X)hA(X) to show that n | 7. Then, cancel the terms 
of minimal degree at both sides of the equality f(X”) = g(X)A(X) and argue 
by induction on df to show that g(X) = g1(X”) and h(X) = h(X"), for some 
g1,h1 € R[X] \ R as prescribed in the statement of the problem 

4. Once more from Gauss’ lemma, we examine when it does happen that f = 
gh, with g,h € Z monic and nonconstant. Adapting the idea of the proof of 
Example 19.32, we have 1 = f(0) = g(O)h(0), so that g(0) = A(O) = +1; 
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hence, a1, a2, ..., dy are pairwise distinct roots of g —h. If g —h # 0, then 
0(g—h) < max{dg, dh} < df =n, so that g—h must have less than n distinct 
roots. Therefore, g — h = 0, and hence f = g’, or 


eC = 1S (2 as. AE HG), 

In particular, n = 2k for some k € N, and the unique factorisation in Z[X] 
gives, without loss of generality, 
g(X) —1= (X—a1)...(X—ay) and g(X) +1 = (X —agy1)...(X — an), 
with aj < a2 <--+ < ag and aps < ag42 <--++ < ay. Therefore, 

2 = (X — agyi)...(X — ang) — (X — ay)... (X — ag) 
and, by evaluating the equality above at a1, a2, ..., ag, respectively, we obtain 

(aj — g41)-.- (Gi — Ar) = 2 


for 1 < i < k. From this point, conclude that if k > 3 then at least two of 
the numbers az41, ag+2, ..., @2% would be equal, which is not the case. Finally, 
look separately at the cases k = 1 and k = 2 to get the listed polynomials. 


. Apply Eisenstein’s criterion in conjunction with the result of item (a) of 


Problem 8, page 469. 


. For the sake of contradiction, assume that f = gh, with g,h € Z[X] \ Z, and 


examine the equality f(X) = g(X)h(X) in Z,[X]. 


. Asin Example 19.28, it suffices to establish the irreducibility of g(X) = f(X+ 


1). To this end, start by applying several times the result of Problem 5, page 469, 
to show that, in Z,,[X], one has 


k-1 


gy a) GP 41)? a OP I. 


Then, also as in Example 19.28, use the fact that 


P P 

Y (Y +1)? = Y ( P jer = yp! 
; J— 

j=l 


j=l 


to get #(X) = xP"(e-)). Then, look at the constant coefficient of g(X) = 
Ff (X + 1) to conclude that 


g(X)= x + m1 X™! +:--+a,X + p, 


with m = pi (p —1) and aj, a2,..., 4m—1 € Z being multiples of p. Finally, 
apply Eisenstein’s criterion. 


. Thanks to Gauss’ lemma, we only need to show that f is irreducible in Z[X]. 


By contradiction, assume we have f = gh, with g,h € Z[X] \ Z and (with no 


22 
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12. 


. Let z,.. 
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loss of generality, since f(0) = p) g(0) = +1. Use Girard-Viéte relations to 
conclude that g, and thus f, has a complex root z of modulus less than |. Then, 
substitute the expression for f in the equality f(z) = 0 and use the hypotheses 
on the coefficients of f to reach a contradiction. 

.» Zn be the complex roots of f. If|z;| < 1 for some 1 < j <n, then 


lao] = |aizj + +++ + anz)| 
< lai|lzj| +--+ lanl |z;\" 
= |ai| Feo lanl, 


which is an absurd; therefore, |z;| > 1 for 1 < j < n. Now, suppose that 
f = gh, with g,h € Z[X] \ Z, say g(X) = bo + 1X +--+ + b,-X” and 
A(X) =co+c.X +--+: +c5X*, withr,s > 1 and b-, cs 4 0. By renumbering 


the z;’s, if necessary, we can assume that the roots of g are z1,...,z,. Then, 
Girard-Viéte relations give 
|bo| = |brl|zil--- [Zr] > |r| and |co| = |es||Zr-+il--- lZnl > les|- 


Hence, |bo| => |b;| + 1 and |co| < |cs| + 1, so that 


lao| = |bo 


Icol = (brl + D(les| + 1) 
les| + [br] + les] +1 


Ics| +27 [by ||cs| + 1 


br||¢s| + 17 


n| +1). 


This way, /|ao| > /|dn| + 1, which is an absurd. 
For item (a), use an argument analogous to that of the proof of Lemma 17.1. 
For item (b), suppose that f = gh, with g,h € Z[X] \ Z. Then, it follows from 


(a) that all of the coefficients of g1(X) = g (x +m-— 5) have a single sign. 


Hence, g2(X) := g1(—X) has nonzero coefficients of alternating signs, so that 
|g2(x)| < |g1(x)| for every x > 0 and, thus, 


1 1 
g(-x+m-=) < g(x+m—5) 
for every x > 0; in particular, |g(m — 1)| < |g(m)|. Conclude that |g(m)| > 2 
and, analogously, |h(m)| > 2. Finally, use the fact that f(m) is prime to reach 
a contradiction. 


Combine Poélya-Szegé’s theorem (cf. previous problem) with the result of 
Problem 9, page 386. 
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Section 20.1 


1. If F(X) = ay, X" +--+ +a, X +a9 € QUX ee look at 
g(X) = ex" + XML. tee —X +49 € QIK 1\Q. 
r 


2. The first de Moivre’s formula gives (cos thn + isin =AE =7)" = 1. Expand the 


2 2h ae cos2 2k = 1to 
2kn 2k 
n 


binomial at the left hand side and apply ‘identity sin 


find nonzero polynomials with rational coefficients having cos —= and sin 


as roots. Alternatively, for cos * ak. use the result of Example 15.27, together 


2h = 2cos* A — 1 aad Theorems 20.9 and 20.12; for sin 2k use 
oka 


with cos — 
Tgonomeny: together ih the fact that cos —* 

. Apply Eisenstein’s criterion (cf. Corollary 19. 27). 

4. First use Gauss’ Theorem 19.14 to show that there exists g € Z[X] \ Z, 
irreducible over Z and such that g(a) = 0. The, use Gauss’ Lemma 19.13 
to conclude that g is also irreducible over Q, so that g = py. 

5. For item (a), apply Remark 20.10 several times, together with the fact that 
Pyaj(X) = X — /aj or = aj, according to whether ./aj € N or ,/aj ¢ N. 
For (b), use (a) and the rational roots test (cf. Proposition 15.16). 

6. Problem 2, page 475, shows that f is irreducible over Q. Now, suppose that 
there exist relatively prime integers a and b and a natural number n > | such 


that f(a) = 0, with a = "/|4|. Let 


g(X) = bX" —a =b(X — a)(X — aa)... (X — aw"), 


is algebraic. 


1S) 


with w = cis = Corollary 20.4 shows that f = pq; hence, item (b) of 
Proposition 20. 3 ‘guarantees that f divides g in Q[X] and, hence, in Z[X] (for 
f € Z[X] is monic). Thus, it follows from Problem 4, page 386, the existence 
of integers 1 < k <1 <n -—1 such that 


f (X) = (X — a)(X — aw") (X — aa") (X — aa!)(X — aa’). 
By examining the independent term of f, conclude that —a> = 2 and, then, 
a = —</2. How, arrive at a contradiction. 


7. Ifa = a,..., @m are the roots of py and B = fj, ..., By are those of pz, the 
natural candidate to be looked at is 


m 1 
h(X) = I] (X — a Bj) = (1 ...m)" | | pp (=x) ; 
i=l : 


l<i<m 
l<j<n 


Now, note that a] ...d@m = +pq(0) € Q. 
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. Item (a) is almost immediate. For multiplication, start by taking f, g € Q[X] \ 


{0} and writing fg = pag +r, withr = O or dr < dpg =n; then, show that 
f(a@)g(a) = r(q@), so that f(a)g(a) € Q(a@). For (b), let 8 = f(a) = g(a), 
with f, g € Q[X] such that f = 0 or Of <n — 1 and likewise for g; note that 
(f — g)(a) = 0, and successively conclude that py | (f — g) and f—g=0. 
Finally, for (c), show that gcd(f, pw) = 1 follows from the irreducibility of 
Pa. Therefore, Corollary 19.4 gives fg + pag = 1, for some g,g € Q[X] 
satisfying g = O or 0g < 0py =n and gq = Oor dq < Of. Then, evaluate this 
equality at a to show that 3 € Q(a). 


. For the first part, note that P 33x(X) = X3 — 2, so that Q(/2) = {a+ bx/2+ 


cv/4; a,b,c € Q}. The second part can be solved following the steps in the 
hint given to Problem 8, page 200. Alternatively, follow the steps listed in the 
hint given to item (c) of the previous problem; more precisely, letting f(X) = 
1+ X +2X? (so that 14+ /2+2,/4 = f (</2)), use Euclid’s algorithm to find 
g.q © Q[X] such that fg + p y5q = 1 and then note that f (/2)g(/2) = 1. 
Start by showing that ¢(0) = O for any such ¢. Then, use (ii) to show that 
#1) = Oor dC) = 1. If ¢C1) = 0, conclude that @¢ = 0; if ¢(1) = 1, show 
successively that 6(m) = m for every m € Zand ¢(r) = r for every rr € Q. 
Now, letting py (X) = X" + Gn X"—! +--- +a,X +p, note that 


0 = 6(0) = o(pala)) = b(@)" + dni (@)""| +++» + a1 (@) + a0, 


so that 6 = $(q) is also a root of py. In turn, show that, for any f € Q[X] \ {0} 
satisfying df < n — 1, we have d(f(a)) = f(d(a@)) = f(A), so that ¢ is 
completely determined by the fact that ¢(@) = f. Conversely, for any such 


root B of py, let d : Q(a) — be given by (f(@)) = f(B), for any f € Q[X] 
satisfying f = Oordf <n-— 1. It is straightforward to show that (i) holds. For 


(i1), let f, g € QLX] \ {0} be given and write fg = pyg +r, withg,r € Q[X] 
such that r = 0 or dr < n — 1. Then, show that 
o(f(@)g(a)) = O((fg)(@)) = (r(@)) = r(B) 
= f(B)g(B) = o(f(a))b(g(@)). 


Section 20.2 
Use item (a) of Proposition 20.14 forn = p*~! andn = p*. 
A typical root of ®2, is @ = cis en for some integer | < k < 2n such that 


2, 


gcd(k, 2n) = 1. Now, use the fact that —w is also of this form to conclude that 
—a is aroot of ®2, too. Then, show that ®2, is a product of factors of the form 
(X —@)(X +o). 
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3. If ®,, and ®, have a nonconstant common factor in C[X], then, being the 
minimal polynomials of any of its common roots, they are identical. Then, letting 
®,, = ®, = f, item (a) of Proposition 20.14 shows that f 2 divides X”"” — 1, 
which is impossible. 

4. Firstly, with n and d as in the statement, note that y(n) = g(d) - ai so that 


ID, = Of, where f (X) = &g(X"/“). Now, if @ = cis **, with 1 < k < d and 
gcd(k, d) = 1, andy € Cis an 7th root of w, show that 7 is an n-th primitive 
root of unity. Finally, conclude that ®, and f have the same roots. 

5. Dirichlet’s theorem allows us to choose d € N tal such that 1 + kd = p, a prime 
number. Now, look at the arithmetic progression th - Dr cal aes eo 

6. The general version of Dirichlet’s theorem assures that the arithmetic progression 
(n+ 2,2n + 2,3n +2, ...) contains infinitely many primes. Letting p = kn + 
2 > a be one such prime and g be a primitive root modulo p, the fact that 
{g, g7,..., g?~|} is a RRS modulo p assures the existence of an integer 1 < 
t < p—1such that g‘ = a (mod p). Now, apply Bézout’s theorem to guarantee 
the existence of natural numbers u and v such that nu = t + (p — 1)v, so that 
(g")" = a (mod p). 

7. Firstly, consider the case of an odd a. Since a is not a perfect square, there 
exist natural numbers b and f, and distinct odd primes qj, ..., g; such that 


a = b’q,...q,. Then, Proposition 12.23 guarantees that it suffices to find 


infinitely many prime numbers p for which (4) mae («) = —1. In turn, 


Quadratic Reciprocity Law assures that such a relation is equivalent to 


rl (Zee, 


jai \4/ 


with s = yy qj. Now, apply Dirichlet’s theorem, choosing p = 4kq1 ...q; + 
1. 

8. Let A = a* — 4b. Multiplying both sides of the given equation by 4 and 
completing squares, show that the hypotheses of the problem guarantee that the 
quadratic congruence x” = A (modn) has a solution for every n € N. If A is not 
a perfect square, write A = a7, with a, B € Z and |a| > 1 being square free. 
Then, use the result of the previous problem to reach a contradiction. 


Section 20.3 


1. We parallel the proof of the cancellation law for addition: 


a-c=b-cS(a:c):c '=(b-0)-c! 


sa-(c-c)=b-(c-c"4) 


>a-l=b-1>a=b, 
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2. Ifa?" =a and p”” = B, then 


(a+ py” =a? + ph =at B, 


so thata + B € R,. Argue in the same way to show that wB € R, and (if a 4 0) 
a! € R,. Then, show that R,, is a field. 


. For the first part, use (20.10). For the second, use (20.9), together with the fact 


that f11(X). -» flay (X) = X? — X and 
XP’ _ xX =(X?)P _XP4XP_X 
= (xX? =X)?» xP = xX 
= (X? — X)((x? — xP! +1). 


. Certainly every f € K[X] of degree 1 is irreducible. Now, if fi, fo,..., fx € 


KK[X] are irreducible (there are at least two such polynomials, X and X — 1), let 
ft = fifo... fe + 1, choose an irreducible factor g €¢ K[X] of f and show that 


SA Sis frr---s Sk 


Section 20.4 
1. Note that the decimal expansion of a is not periodic. 
2. For the sake of contradiction, ifaw + 6 = r, withr € Q, and f(a) = 0 for 


f (X) = ay X" +--+ +a,X +. a9 € Q[X] \ {0}, then 
an(r — BY" +++» +ay(r — B) +a9 = 0. 


Use this equality to show that 8 would be algebraic. 


. If 2/a was algebraic, then several applications of Theorem 20.12 assure that the 


same would be true of (v/a) = = a. If a” were algebraic, and f € Q[X] \ {0} 
satisfies f(a”) = 0, find g € Q[X] \ {0} such that g(a) = 0. 


. For the second part, if Q[a@] was a subfield of C there would exist n € N and 


ao, 41,---,4n € Q such that a, 4 0 and x =agtaja+---+a,a". Show that 
this would force a@ to be algebraic. 


. Leta € R and, for 1 < k < 2016, let Ax be the point of the cartesian plane 


who coon are the real and imaginary parts of (a + ia*)o*—!, with w = 
cos ae +isin we. Then A; A2...A2016 is a regular 2016-gon, and it suffices 
to show that it is possible to choose @ in such a way that Ay ¢ S for every k. 
Let I be one of the circles in S$, with center O = (a, b) and radius r, so that 


: Wk-la - 2k-l)a 
a,b,r € Q. Letting cos MIG = CE and sin MIE = 5, we have 


(a+ a i)(c + si) = (ca — sa’) + (sa + ca’)i. 
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Then, Ay € 2 & A,O =r or, which is the same, if and only if 
((cox — sa’) — a)” + ((sa + ca’) — b) —r?=0. 
In turn, this last equality is equivalent to 
at + (1 — 2be + 2as)a? + (2ac — 2bs)a + (a? +b* —r) = 0. 
However, since c and s are algebraic over Q (cf. Problem 2, page 486), we 


conclude from the above remark that w should be algebraic over Q. Therefore, 
by choosing a € R to be transcendental, we conclude that Ax ¢ S for every k. 


Section 21.1 


1. For every integer k > 0, set f(x) = aye sin.x + bye cos x, so that 
f** YD (x) = —(az + dee sin x + (az — bye cos x 


and, hence, ax41 = —(ax + bx), be41 = ax — be. Then, conclude that bg42 + 
Qbe+1 + 2by = 0 for k > 0. 

2. The characteristic polynomial of the involved recurrence relations is f(X) = 
kx* — xk-l _...— X — 1. Ifa € C is aroot of f such that |w| 4 1, we have 


Klo:|* = lok! +---+041 


<|o[AT+-+-+|al+1 
_ rai 
fee] =a" 


so that kla|‘+! — (k + l)la| + 1 < 0. Study the first variation of g(X) = 
(X — 1) f(X) = kX*t! — (k +1) X* + 1 to conclude that |a| < 1. Ifa € Cisa 
root of f such that |w| = 1, review the above estimates to obtain 


k=klal|* = |o* 1 +--- +04] 
<|afP*+-.-+lal+1=k 


and, hence, to conclude (from the condition for equality in the triangle inequality 
for complex numbers, cf. Problem 3, page 326) that a/ = A;-1, forsomeaj; > 0. 
Up to this point, we have shown that, apart from 1, all a the other conmples 
roots of f have modulus less than 1. Moreover, they are all simple roots, for 
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g!(X) = k(k + 1)X*—!(X — 1) implies that the only multiple root of g is 1, with 
multiplicity 2. Now, use Theorem 21.3 to obtain, for m > k, 


-1 =j 
Xm = Uy +uas +--- + ugar, 


with a} = 1, a2, ..., a, being the complex roots of f. Then, conclude that 
Xm ii 1. In order to compute wu 1, note that 


uy tuoa+::-+up=ay, 
Ul + UIA + +++ FURR = a2 


k-1 


k-1 
uj + U2Q> = 


Tr PUR | = ak 


and f (X) = (X — 1)h(X), with f(X) = (X — 1)(kX*—! 4+ (k- 1) X47 4. + 
2X + 1). Finally, in the above system, multiply the j-th equation (from top to 
bottom) by 7 and add the results to reach 


k(k +1) 


k 
5 ui + Yujh(aj) = kag + (k - Vap-1 +--+ + 2a2 + a4. 


j=2 


Argue analogously to yj. 

3. If f(X) = X? —3X? + 1, start by showing that f has real roots a > b > c, such 
that — <c< -> £ <b< i and0Q < b”+c” < | for every integer n > 2. 
Ifa, =a" +b" +c" forn > 1, show that aj, a2, a3 € Zand ag43 = 3ag42 — ag 
for k > 1; then, conclude from what we did above that |a” | = a, — 1. Finally, 
use the linear recurrence relation satisfied by the sequence (dy, ),>1 to show that 
ak+17 = ax (mod 17); alternatively, invoke the result of Problem 9, page 469. 


Section 21.2 


1. For the second part, suppose that |z| > R and let € = |z| — R. Use the definition 
of convergence to find ann € N such that |z, —z| < €, and deduce that |z,| > R, 
which is a contradiction. 

2. Use the result of the previous problem. 

3. Adapt the reasoning presented in the proof of Proposition 21.15. 

4. Apply the result of items (a) and (b) of the previous problem to the sequence of 
partial sums of the series 7... | (azz + bux). 

5. Adapt, to the present case, the proof of Proposition 3.7. 

7. Start by observing that, if (Z,)n>1 is a sequence in X such that z, — z, with 
z € X, then the triangle inequality gives || f(zn)| — | f(2)|| < |f(n) — FI. 

8. Start by observing that, if (Zn)n>1 iS a sequence in X such that z, — z, then 
[Zn — z| < B for every sufficiently large index n; hence, | f(zn) — f(z)| < 
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A|Zn — z|, also for every sufficiently large index n. Now, given € > 0, note that 
lf (Zn) — f(2)| < € if |Zzn — z| < a then, use the convergence of (Z7)n>1 to z. 

9. Case m = | is the content of Example 21.12. For m = 2 and |z| < ae we get 
from the initial case that 


5 — =| rate) [ald | = att! 


k>0 1>0 k,1>0 
= Yia + 1)a"z". 
n>0 


By induction, if 


1 n+m—2\ ,, 
azar L( m—2 Janz 


then 
1 ee 1 
(l—az)™ ~~ 1—az (l—az)r-! 


where, in the last equality above, we have used the columns’ theorem of Pascal 
triangle (cf. Proposition 4.17 of [8]). 


Section 21.3 


1. Use Theorem 21.22, together with the fact that the characteristic polynomial of 
the given sequence is 


k 
x**_oxk 41 = (x*-1° =] [(X-0!)’. 
j=l 


2. An easy inspection shows that the characteristic polynomial of (ay,)n>1 is (X — 
1)(X — 2). Thus, Theorem 21.22 gives ay = A+(B+C(n-— 1))2”~!, for some 
real constants A, B and C. Since at — 5(B + C(n — 1)), we must have C = 0 
and 4 = 3. Finally, a; = 1 gives A+ B = 1. 


Glossary 


Problems tagged with a country’s name refer to any round of the corresponding 
national mathematical olympiad. For example, a problem tagged “Brazil” means 
that it appeared in some round of some edition of the Brazilian Mathemati- 
cal Olympiad. Problems proposed in other mathematical competitions, or which 
appeared in mathematical journals, are tagged with a specific set of initials, as listed 


below: 


AIME 

APMO 
Austrian-Polish 
BMO 

Crux 


EKMC 

IMO 

IMO shortlist 
Miklos-Schweitzer 


American Invitational Mathematics Examination 
Asian-Pacific Mathematical Olympiad 

Austrian-Polish Mathematical Olympiad 

Balkan Mathematical Olympiad 

Crux Mathematicorum, a mathematical journal of the Cana- 
dian Mathematical Society 

E6tvés-Ktirschak Mathematics Competition (Hungary) 
International Mathematical Olympiad 

Problem proposed to the IMO, though not used 

The Miklés-Schweitzer Mathematics Competition (Hun- 


gary) 


OCM State of Ceara Mathematical Olympiad 

OcS South Cone Mathematical Olympiad 

OBMU Brazilian Mathematical Olympiad for University Students 

OIM Iberoamerican Mathematical Olympiad 

ORM Rioplatense Mathematical Olympiad 

Putnam The William Lowell Mathematics Competition 

Saint Petersburg Mathematical competition of the city of Saint Petersburg, 
Russia 

TT The Tournament of the Towns 
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